Procedures and materials for conferring disease resistance in plants

ABSTRACT

The present invention is in the field of rice genetics. More specifically, the invention relates to nucleic acid molecules from regions in the rice genome, which are associated with resistance to the fungal pathogen  Magnaporthe grisea . The invention relates to methods which employ such nucleic acid molecules to produce plants, particularly rice plants, that are resistant to infection by  Magnaporthe grisea . The invention relates to the use of such nucleic acids or fragments thereof as markers for resistance to infection with  Magnaporthe grisea  in a plant breeding program. The invention also relates to proteins encoded by such nucleic acid molecules as well as antibodies capable of recognizing these proteins.

[0001] This application claims priority under 35 USC 119 from U.S.Provisional Application Serial No. 60/352,106, filed on Jan. 25, 2002and U.S. Provisional Application Serial No. 60/353,304, filed on Feb. 1,2002, the disclosures of which are incorporated herein in theirentirety.

BACKGROUND OF THE INVENTION

[0002] Rice blast, caused by the fungus Magnaporthe grisea, is one ofthe most devastating diseases in rice. The disease occurs in most ricegrowing areas worldwide, costing farmers a loss of nearly $5 billion peryear (Moffat 1994). The high variability in M. grisea's pathogenicitymakes the control and management of rice blast difficult. In addition,resistance in many cultivars is short-lived in disease-conduciveenvironments. For the last four decades, rice geneticists and breedershave studied the genetics of blast resistance and have tried to collectnew sources of resistant germplasm to breed for durably resistantcultivars.

[0003] Genetic analysis of resistance to blast began in the early 1960swhen Goto established the differential system for races of M. grisea inJapan (On 1985). Over 20 loci for complete resistance have been mappedrelative to molecular markers on the rice molecular map (McCouch et al.1994). To elucidate the molecular mechanism(s) of blast resistance,map-based cloning of a number of blast resistance genes is beingactively pursued in several laboratories. Recently, two resistancegenes, Pib and Pita, have been successfully isolated. Pib wasintrogressed independently from two Indonesian and two Malaysiancultivars into various japonica cultivars (Yokoo et al. 1978). Thededuced amino acid sequence of the Pib gene contains a nucleotidebinding site (NBS) and leucine-rich repeats (LRRs) (Wang et al. 1999), acommon feature of many cloned plant resistance genes (Bent 1996).Interestingly, Pita also encodes a putative cytoplasmic receptor with acentrally localized nucleotide-binding site and leucine-rich domain(LRD) at the C-terminus. AVR-Pita(176) protein is shown to bindspecifically to the LRD of the Pi-ta protein, both in the yeasttwo-hybrid system and in an in vitro binding assay, indicating that theAVR-Pita(176) protein binds directly to the Pi-ta LRD region inside theplant cell to initiate a Pi-α-mediated defense response (Bryan et al.2000 and Jia et al. 2000). Comparison of the sequences of 6 resistantand 5 susceptible alleles of Pita has revealed overall amino acidpolymorphism with only one single amino acid determining specificity.

[0004] It is desirable to have additional methods and tools forproducing and identifying plants that are resistant to fungal diseases,particularly diseases caused by the fungus Magnaporthe grise.

SUMMARY OF THE INVENTION

[0005] The present invention provides isolated nucleic acids that areuseful for producing or identifying plants, particularly plants in thegrass family with resistance to diseases caused by the fungusMagnaporthe grisea. In one aspect, the nucleic acids comprise a sequencewhich encodes the NBS1 protein, the NBS2 protein, the NBS3 protein, theNBS4 protein, the NBS 5 protein, the NBS 6 protein or combinationsthereof. Preferably, such sequence is incorporated into a transgene orexpression construct which can be used to produce transgenic plantswhose genome comprises such sequence. In another aspect, the nucleicacid is a probe that comprises a sequence which specifically hybridizesto a contiguous sequence of at least 15 nucleotides in one or more ofthe following sequences SEQ ID NO. 84, SEQ ID NO. 86, SEQ ID NO. 88, SEQID NO. 90, SEQ ID NO 92, or SEQ ID NO. 94, or the complement thereof. Inanother aspect, the invention is a primer set which comprises a forwardprimer and a reverse primer that can be used in a polymerase chainreaction to amplify a unique region in the Pi9 locus. Preferably, theforward and reverse primer comprise a sequence which is identical to orthe reverse complement of a contiguous sequence of at least 15nucleotides in one or more of SEQ ID NO. 84, SEQ ID NO. 86, SEQ ID NO.88, SEQ ID NO. 90, SEQ ID NO. 92, or SEQ ID NO. 94.

[0006] The present invention also provide for methods which employ thepresent nucleic acids to provide a transgenic plant that is resistant toan infection, particular rice blast, that is caused by the fungusMagnaporthe grisea. The present invention also provides for plants andparts of the plants produced by such method. Plant parts, withoutlimitation, include seed, endosperm, ovule and pollen. In a particularlypreferred embodiment of the present invention, the plant part is a seed.

[0007] The present invention also provides methods for identifyingtransgenic or non-transgenic plants that comprise an NBS1, NBS2, NBS3,N1BS4, NBS5, or NBS6 rice blast resistant allele. The method comprisesisolating DNA or RNA from a cell of the plant and assaying for thepresence of such allele using the primers or probes of the presentinvention.

BRIEF DESCRIPTION OF THE DRAWINGS

[0008]FIG. 1. SCAR markers pB8 and pBA14 were linked to the Pi9 gene.About 20 ng of genomic DNA and pB8- and pBA14-specific primers were usedin the two PCR reactions.

[0009]FIG. 2. Southern blot analysis of pB8 (A) and 19L (B) in an F₂population. About 2 μg of DNA per sample was digested with restrictionendonuclease HindIII (in both A and B) and separated on a 1% agarose gelby electrophoresis. PCR products of pB8 and 19L were used ashybridization probes.

[0010]FIG. 3. Genetic and physical map of the Pi9 region. The geneticmap was constructed using the mapping data from 1280 F₂ plants. Thenumbers in brackets are genetic distances in centiMorgans from Pi9.Insert size is shown in parenthesis after each BAC clone. Recombinantsfor each marker were determined in 596 plants using Southernhybridization. The putative Pi9 introgression region was estimated basedon the hybridization results of pB8 and 19L that did did not hybridizewith IR31917 but did hybridize with both 75-1-127 and O. minuta.

[0011]FIG. 4. The genetic linkage map of the Pi2 and Pi9 region. Thismap is constructed based on the consensus mapping data from DH, Pi9 andPi2 mapping populations. The numbers at the right side are geneticdistances in centiMorgans. RG64 is a RFLP marker from CornellUniversity. R2123 is a RFLP marker from the Japanese Rice GenomeProject. The rest of the markers were isolated from this work. Thecomputer program Mapmaker 3.0 with kosambi function was used in the mapconstruction. FIG. 5. Southern blot hybridization of 75-1-127 (lane 1),IR31917 (lane 2), CO₃₉ (lane 3) and two recombinants (R198-10R, lane 4,and R174-11R, lane 5) with BAC end 12L. DNA was digested with both DraIand HindIII, and separated by electrophoresis on a 1% agarose gel. Themembrane was hybridized with a ³²P-labeled 12L probe. Arrow indicatesthe segregating band among the parents and two recombinants.

[0012]FIG. 6. Genomic DNA sequence at the Pi9 locus as well as sequencesof the NBS1-NBS6 open reading frames.

[0013]FIG. 7. Structure of the Pi9 locus. All the six NBS genes werepredicted by the programs including gene prediction and homology search.The string of the sequence is indicated by the gray line (labeled as:specific sequence”). The exons are indicated by the dark boxes along thegray line, and the introns by the white boxes along the line. The arrowsshow the transcription direction of the NBS genes. The numbers below theNBS genes shows the start and stop site along the sequence string. NBS4may be a pseudogene, which has some stop codons in the coding regionwhose exons ar not given. The insertion element indicated in NBS6 shows94% nucleotide sequence similarity to the LTR of the riceretrotransposon RIRE8.

[0014]FIG. 8. Sequence comparison analysis of the six NBS genes. The sixputative NBS/LRR genes were translated into proteins. The sequencecomparison program of Gap was used to compare the amino sequence of thesix putative NBS/LRR proteins and PIB, a cloned blast resistance proteinin rice.

[0015]FIG. 9. Phylogeny of six NBS genes was conducted using the programof the Phylodendronn.

[0016]FIG. 10. Disease reaction of the Pi(transgenic plants.

[0017]FIG. 11. PCR analysis of Pi9 mutant lines.

[0018]FIG. 12. cDNA sequences for NBS3 and NBS5.

DETAILED DESCRIPTION OF THE INVENTION

[0019] The present invention relates to nucleic acids that can be usedto provide and to identify plants, particularly plants in the grassfamily, more particularly rice plants, that are resistant to diseasecaused by the fungal pathogen Magnaporthe grisea. The present inventionalso relates to methods of producing transgenic plants that areresistant to such disease and to methods of identifying plants whosegenome comprises one or more NBS rice blast resistant alleles. Thenucleic acids and methods of the present invention are based, at leastin part, on applicants' discovery of certain genes in the P19 locus thatconfer resistance to rice blast, a disease caused by the fungusMagnaporthe grisea.

[0020] Pi9 confers resistance to 43 isolates collected from 14countries.

[0021] The wild rice Oryza minuta is a tetraploid species with a genomiccomposition of BBCC and is a source of resistance to both bacterialblight and rice blast (Sitch et al. 1989). Through wild hybridizationand repeated backcrossing, the resistance gene Pi9 was transferred fromOryza minuta into the elite breeding line IR31917 (Amante-Bordeos et al.1992). The introgression line, 75-1-127, was tested for resistance tomany Philippine isolates at the International Rice Research Institute(IRR1) and no compatible isolates were identified (H. Leung, personalcommunication).

[0022] A total of 43 rice blast isolates of Magnaporthe grisea collectedfrom 14 countries were used to determine if Pi9 confers resistance torice blast. To test the spectrum of the Pi9 gene, both Pi9 introgressionline 75-1-127 and the susceptible recurrent cultivar, IR31917, wereinoculated with the same set of isolates in the inoculation experiment(Table 1). 75-1-127 was highly resistant to all 43 isolates. Theseinoculation results suggest that Pi9 confers high and broad spectrumresistance to blast.

[0023] Three RAPD Markers Linked to Pi9 were Identified.

[0024] The RAPD technique (Williams et al. 1990) and bulk segregantanalysis (Michelmore et al. 1991) were used to screen for Pi9-linkedmarkers. An F₂ population was generated from a cross between 75-1-127and the susceptible cultivar, CO₃₉. Seventy-nine F₂ plants wereinoculated with Philippine blast isolate PO₆-6. Fifty-four resistantplants and twenty-five susceptible plants were identified which isconsistent with a single dominant gene segregation ratio (3:1). DNA wasextracted from 10 resistant and 10 susceptible plants.

[0025] To detect more polymorphic bands and better separation of theamplified fragments, especially those between 100 bp and 1000 bp, ³³Plabeled dCTP nucleotide was added to the PCR reaction and the reactionmix was run on a 4.5% polyacrylamide sequencing gel. After screeningwith over 900 Operon random primers, only three primers were found withreproducible polymorphic bands between the resistant and susceptiblepools. After confirmation in 54 individual resistant and 25 susceptibleF₂ plants, specific bands were excised from the dried polyacrylamidegel. The eluted DNA was used as a template to re-amplify the band withthe same random primer. After gel purification, all three fragments werecloned into the pGEM-T vector (Promega) and the correspondingrecombinant clones were named pB8, pBA14 and pBV14. After sequencing theinsert, specific primers were designed for each marker based on theirsequences. Two markers (pB8 and pBA14) were then successfully convertedinto SCAR markers since a single band (700 bp for pB8 and 400 bp forpBA14) was amplified in 75-1-127 but not in CO₃₉ (FIG. 1). Nopolymorphic band was found between resistant plant 75-1-127 and thesusceptible line CO₃₉ using primer pairs based on the sequence of markerpBV14, thus, pBV14 was not used in the succeeding experiment.

[0026] To determine the genetic distance accurately between these twoSCAR markers and Pi9, 1280 F2 plants were inoculated with isolate PO₆-6and the disease reaction of each plant was obtained. Mini-scale DNA wasisolated from each plant for PCR amplification. Five of the 1280 F₂plants showed recombination between pBA14 and Pi9, indicating that thepBA14 marker lies approximately 0.4 cM from Pi9. No recombinant plantsbetween pB8 and Pi9 were identified. Southern hybridization of 75-1-127,CO₃₉ and the O. minuta donor line (accession 101141) genomic DNArevealed that pB8 is a single copy fragment present only in theresistant plant 75-1-127 and O. minuta (FIG. 2A). pBA14 is a mediumrepetitive element and only a faint polymorphic band was observedbetween 75-1-127 and CO₃₉, which prevented its use in Southernhybridization analysis of resistant and susceptible plants (data notshown). To confirm the PCR result, DNA was extracted from, 79F₂, 55 F₃and 462 F₅ plants for Southern analysis. When pB8 was used as a probe,all HindIII digested DNA isolated from the resistant plants showed asingle band and no hybridization was detected in any of the susceptibleplants (FIG. 2A). Both PCR and Southern hybridization results suggestedthat pB8 might be a part of or tightly linked to the Pi9 gene.

[0027] Construction of a Bacterial Artificial Chromosome (BAC) Libraryof the Pi9-Introgression Line.

[0028] A BAC library was constructed using high molecular weight (HMW)DNA isolated from 75-1-127 according to the procedure described by Wanget al. (1995). Since only one size-selection was performed, the averageinsert size was about 45 kb. To maximize the chance of getting a DNAfragment containing the Pi9 gene, over 200,000 clones were collected andstored in 100 pools (about 2000 individual clones per pool). The libraryequaled approximately 21 genome equivalents based on the rice genomesize (430 Mb) and average insert size of the BAC clones (45 kb). BAC DNAwas mini-prepared from each pool.

[0029] For PCR screening of the BAC library, BAC DNA was diluted 5 timesto 20/111. For Southern hybridization, 5 μl of BAC DNA (500 ng) wasdigested with HindIII and separated on 1.0% agarose gel beforetransferring to nylon membranes. Since the fine-mapping result indicatedthat pB8 might lie within the Pi9 gene or tightly linked to the gene,the pB8 primer pairs were used to screen the 100 BAC pools. A 700 bpfragment was amplified in 12 of the 100 pools. These 12 positive poolswere then confirmed with membrane hybridization using pB8 as the probe.To identify the individual pB8-containing clones, 4000-5000 clones werestreaked on three large petri dishes from positive pool. Individualclones that hybridized with the pB8 probe were then isolated. Eachpositive clone was further confirmed by Southern hybridization. Theinsert sizes of all 12 identified clones were determined using Bio-Rad'sGene Mapper II after digestion with restriction enzyme NotI. The insertsizes were ranged from 15 kb to 80 kb.

[0030] Isolation of BAC Ends for the Construction of a BAC Contig of theIntrogression Region.

[0031] To construct a contiguous map of BAC clones covering the Pi9region, two BAC clones (BAC19 and BAC12) were selected because of theirinsert size (over 50 kb) and different HindIII-restriction digestionpatterns. A 600 bp fragment, 12R, at the right end of BAC12 (sp6 side ofthe BAC vector) was isolated after digestion with both NotI and EcoRI.12L, a 2.3 kb fragment at the left end of BAC12, was isolated whendigested with both NotI and SpeI. For BAC19, the left end consists ofthe NotI and EcoRV fragment (4.0 kb, 19L) while the right end consist ofthe NotI and SpeI fragment (3.0 kb, named 19R). To confirm whether ornot the two BACs overlap, the isolated end sequences from each BAC (theNotI and HindIII vector sequence was removed) were hybridized with BAC12and BAC19. Both 12L and 19R hybridized only with their original BACclones. 12R and 19L ends hybridized to both BAC19 and BAC12, suggestingthat these two BACs overlap and extend in opposite directions. This isconsistent with the restriction digestion patterns that they had 5identical bands when digested with XbaI (data not shown). The endsequences from two BAC clones were further characterized for copy numberby Southern hybridization. All four ends isolated from the two BACsshowed single or few bands in 75-1-127, indicating these ends aresuitable for BAC contig construction and chromosome walking.

[0032] To extend the BAC contig past the 12L end, 12L was used as probein hybridization with the membranes containing all of the BAC pool DNA.Five BAC pools were identified containing the 12L sequence. Among them,BAC3 (a 40 kb BAC) was chosen for further characterization because ofits minimum overlapping with BAC12 based on restriction enzyme digestionpatterns (data not shown). Both BAC3 end sequences were obtained usingthe same method used for the isolation of the BAC12 and BAC19 ends. TheBAC3 right end sequence is 3.8 kb and left end sequence is 2.4 kb.Southern hybridization confirmed that 3R overlaps with BAC12 but notwith BAC19. Both 3R and 3L ends hybridized with both 75-1-127 andIR31917 (data not shown). From this information, a 100 kb BAC contigcomprised of BAC19, BAC12 and BAC3 was constructed (FIG. 3).

[0033] Pi9 is Located on Chromosome 6 Between RFLP Marker RG64 andR2131.

[0034] To map the Pi9-linked markers and BAC ends on the rice molecularlinkage map, a doubled haploid (DH) mapping population derived from across between IR64 and Azucena (Huang et al. 1994) was used. The genomicDNA of IR64 and Azucena was first digested with 8 restriction enzymes(BamHI, BglII, DraI, EcoRI, HindIII, PstI and XbaI). When pB8 was usedas a probe, no hybridization band was observed although the blot wasexposed to X-ray film for 5 days. Another marker, pBA14, containedrepetitive sequences and showed no polymorphism between the two mappingparents. Therefore, neither marker is preferred for mapping in thepopulation.

[0035] To map the Pi9-linked BAC ends on the linkage map, a parentalpolymorphism survey was conducted using 18 different restriction enzymesfor all 6 BAC ends. Polymorphism between the two mapping parents wasdetected with only 4 ends (12R, 12L, 3R and 3L) for at least onerestriction enzyme. No polymorphism was detected at the 19R locus. LikepB8, no hybridization signal was found when 19L was used as a probe. Anappropriate enzyme was selected to digest the 111 DH lines for each BACend. Mapping data analysis indicated that all 4 sequences were mappedonto chromosome 6 between RFLP markers RG64 and R2131 (FIG. 4).

[0036] Establishing a High-Resolution Map at the Pi9 Locus.

[0037] To construct a high-resolution map at the Pi9 locus, a total of596 plants (79 F₂, 55 F₃ and 462 F₅ plants) were used in hybridizationwith one RAPD marker (pB8) and six BAC ends (12L, 12R, 19L, 19R, 3L and3R). Among them, 340 plants were resistant and 256 plants weresusceptible. Each marker was hybridized with the parental survey blotsthat contained 75-1-127 and CO₃₉ DNA digested with 14 enzymes. Anappropriate enzyme showing polymorphism between the two parents for eachmarker was used to digest all 596 plants. End 19R was not used in theexperiment since it did not show any polymorphism between 75-1-127 andCO₃₉ for 20 enzymes, again confirming that it may be outside theintrogression region. Hybridization results indicated that norecombinants were found between Pi9 and either 12R, 19L, or pB8. For 3R,only one recombinant (plant R198-10R) was identified. Two recombinants(R198-10R and R174-11R, resistant phenotype and susceptible genotype)were found when end sequences 12L and 3L were used as probes. Thehybridization result of 12L with the two recombinants is shown in FIG.5. Based on these mapping results, the Pi9 gene was mapped between theBAC ends 19R and 3R (FIG. 3 and FIG. 4).

[0038] Sequence Analysis of a 76 kb Fragment at the Pi9 Locus

[0039] From the hybridization results, it was confirmed that BAC12 andBAC3 overlaps (FIG. 3). To obtain the sequence information at the Pi9locus, both BAC12 and BAC3 were fully sequenced using a short gunmethod. Purified plasmids of the two BACs were sonicated, separately,using a sonicator. Sheared DNA fragment was then size-selected on aagarose gel, damaged DNA ends were repaired using a T4 polymerase(Roche), and ligated to a pBluescript (KS) vector. About 700 individualclones from the BAC12 shotgun library and 450 individual clones from theBAC3 shotgun library were sequenced from both ends. The sequenceanalysis program Phred/Phrap was used to assemble all sequence data.Sequence analysis showed BAC12 is 58 kb and BAC3 is 40 kb. About 18 kbwas overlapped between these two BACs. The total length of the DNAfragment from the two BAC is 76,272 bp (FIG. 6).

[0040] Identification of a NBS/LRR Gene Cluster

[0041] To identify the open reading frame (ORF) accurately from thegenomic sequence, two different approaches were used. Firstly, the geneprediction program of GenScan1.0 was used to analyze the coding sequence(CDS) in the 76 kb region. Secondly, the homology search using BLASTprogram was used to modify the gene prediction result. A total of sevenputative genes were identified. The first gene from the sp6 end of BAC12is homologous to maize nitric induced gene. The other six genes (namedNBS1-NBS6) are candidate genes of Pi9 since all of them show highhomology to NBS/LRR type disease resistance genes cloned in plantspecies (FIG. 7, Bent, 1996). The exact position of each NBS/LRR gene isshown in FIG. 7. Among the six Pi9 candidate genes, NBS3 and NBS2 wereconfirmed with the partial sequence of the relative cDNA. It seems theNBS6 is not complete. This gene is also truncated in the 5′ region by aninsertion of a solo-LTR, which shows 94% of identity in nucleotidesequence to the LTR of rice gypsy-type retrotransposon, RIRE8. Thissolo-LTR shows typical feature including duplicated target sequence ofGACCG and inverted sequence of TGTCAC.

[0042] Sequence Comparison Analysis of Six Candidate Genes

[0043] The six putative NBS/LRR genes were translated into proteinsequence. The sequence comparison program of Gap was used to compare theamino sequence of the six putative NBS/LRR proteins and PIB, a clonedblast resistance in rice (Wang et al. 1999). The identity and similarityof all the NBS/LRR proteins were shown in Table 2 and the alignment ofthe candidate genes are shown in FIG. 8. The NBS2 and NBS5 shows 98% ofidentity in amino acid sequence each other and NBS4 and NBS6 shows 93%.All the six NBS/LRR protein found in Pi9 locus show higher than 28%identity in amino acid sequence to PIB protein. The multiple sequencealignment of the six NBS/LRR proteins was done by the program ofClustalw (accurate), which was shown in FIG. 7. A phylogeny analysis ofsix candidate genes was conducted using the program of the Phylodendronn(FIG. 9). The analysis revealed a similar result on the relationship ofthe six candidate genes with that obtained using the Gap program.

[0044] A Possible Duplication Event in the NBS/LRR Gene Cluster

[0045] Based on the sequence identity in the nucleotide sequence, twosequence fragments with high homology each other were identified (FIG.6). The sequence from 38882 bp to 44118 bp shows 98% of identity to thesequence from 61740 bp to 66982, which are corresponding to the NBS2 andNBS5 separately. The sequence from 46029 bp to 49812 bp shows 94% ofidentity to the sequence from 68294 bp to 76251 bp which arecorresponding to the NBS4 and NBS6 separately. The high identity of thegenomic DNA region in Pi9 locus imply that one duplication eventoccurred during the evolution of this resistance gene locus. Theretrotransposon inserted into the NBS6 gene may have occurred after theduplication event.

[0046] Fine-Mapping of the Pi9 Locus with NBS/LRR Genes

[0047] To pinpoint the Pi9 gene in the BAC contig, all the NBS/LRRcandidate genes were used in Southern hybridization with a total of 596plants (79 F₂, 55 F₃ and 462 F₅ plants). No recombination was observedbetween either NBS1, NBS2 or NBS3 with Pi9. Only one recombinant wasfound between NBS4 and Pi9 whereas two recombinants were found betweenNBS5 and NBS6 with Pi9. These results indicate that the Pi9 gene liesbetween BAC end 12R and candidate gene NBS4 (FIG. 7).

[0048] Screen for cDNAs Clone from the Pi9 cDNA Library

[0049] To isolate cDNA clones at the Pi9 locus, a cDNA library was madeusing the total RNA isolated from the infected leaf tissues of the Pi9line 75-1-127. Leaf tissues were harvested at 12 and 24 hrs afterinoculated with blast isolate PO₆-6. Equal amount of mRNAs from the twotime points were mixed for the first strand cDNA synthesis. Detailedprocedures were followed according to the manufacturers instruction(GIBCO-BRL, USA). About 7.2 million individual clones with averageinsert size of 1.5 kb were stored in 200 384-well plates.

[0050] Using both NBS1 and NBS2 as probes in colony hybridization, twocDNA clones were identified. These two clones were fully sequenced.BLAST search indicated that both genes had some homology with knownNBS/LRR disease resistance genes. Sequence analysis revealed that one ofthe cDNAs (1.8 kb) matched with NBS3 and another cDNA (2.3 kb) matchedwith NBS5. Both clones are not full length based on the predicted ATGsite in the genomic sequence.

[0051] Transformation of BAC 12 into Susceptible Cultivar TP309

[0052] When BAC12 was digested with restriction enzyme NotI, twofragments (45 kb and 13 kb, respectively) were released from the clone.The 45-kb fragment contains NBS1, NBS2 and NBS3 while the 13 kb fragmentcontains NBS4. These two NotI fragments are cloned into our newlyconstructed pTAC8 vector. Recombinant clones as TAC45 and TAC13 weretransferred to Agrobacterium strain LBA4404. TP309 was transformed withthese two constructs using the procedure established in our lab (Yin andWang, 1999). Sixty and fifty independently transformed lines weregenerated from the transformation of TAC45 and TAC, respectively. About3 TI plants from each line were transplanted in pots with soil and keptin greenhouse. Plants were selfed to produce T2 seeds.

[0053] Disease Evaluation of Transgenic Plants

[0054] About 10-15 T2 seeds were sowed in trays and plants were growingin growth chamber. Eighteen days old plants (at 4 leaf stage) wereinoculated with rice blast isolate PO6-6. Disease reaction was scored 7days after inoculation based on a 0-5 scoring system. Inoculationresults showed that all transgenic line transformed with TAC13 werehighly susceptible to the isolate. In TAC45 transgenic lines, only oneline (TAC106) showed segregation of resitance and susceptibility toblast. Among the 12 inoculated plants, 10 plants were resistance and 2plant were susceptible (FIG. 10).

[0055] Small scale DNA was extracted from each plant. Primer pairs fromNBS1, NBS2, NBS3 were used to check if these genes are present in theplants. The PCR result confirmed that NBS1, NBS2 and NBS3 were presentin the resistant plants. Southern hybridization method will be used whenenough leaf tissue for DNA extraction is available.

[0056] Mutant Generation from the Pi9 Plants

[0057] To identify mutants at the Pi9 locus or in the Pi9-mediatedresistance pathway, about 20,000 75-1-127 seeds (carrying the Pi9 gene)was treated with the chemical mutagen DEB. Seeds were divided into twoparts and treated with the chemical at concentrations of 0.04 and 0.06%,respectively. About 70% of the germination rate from 0.06% treatment and80% of germination rate from 0.04% treatment were observed.Approximately, seed from 12,000 M1 plants were harvested.

[0058] Bulk M2 seeds of the mutant population were sowed in a soil.Three weeks old plants were inoculated with PO6-6. Plants with visiblelesions were picked 6 days after inoculation. Selfed seeds wereharvested from the putative susceptible M2 plants. To confirm thedisease reaction of the selected plants, inoculation was carried out inM3 generation. Lines showed typical susceptible lesions are transplantedand DNA was extracted from each plant for PCR and Southern analysis. PCRanalysis with 5 NBS genes showed that NBS2 and NBS3 were deleted in allthe susceptible mutant lines (FIG. 11).

DEFINITIONS

[0059] By “cDNA” is meant DNA that is complementary to and derived froma mRNA.

[0060] By “complementarity” is meant a nucleic acid that can formhydrogen bond(s) with other nucleic acid sequences either throughtraditional Watson-Crick or other non-traditional types of base pairedinteractions.

[0061] By “constitutive promoter” is meant promoter elements that directcontinuous gene expression in all cell types and at all times (i.e.,actin, ubiquitin, CaMV ³⁵S, 35T, and the like).

[0062] By “developmental specific” promoter is meant promoter elementsresponsible for gene expression at specific plant developmental stages,such as in early or late embryogenesis and the like.

[0063] By “enhancer” is meant nucleotide sequence elements which canstimulate promoter activity such as those from maize streak virus (MSV)protein leader sequence, alfalfa mosaic virus protein leader sequence,alcohol dehydrogenase intron 1, and the like.

[0064] By “expression” as used herein, is meant the transcription andstable accumulation of mRNA inside a plant cell. Expression of genesalso involves transcription of the gene to create mRNA and translationof the mRNA into precursor or mature proteins.

[0065] By “foreign” or “heterologous gene” is meant a gene encodinga-protein whose exact amino acid sequence is not normally found in thehost cell, but is introduced by standard gene transfer techniques.

[0066] By “gene” is meant to include all genetic material involved inprotein expression including chimeric DNA constructions, genes, plantgenes and portions thereof, and the like.

[0067] By “genome” is meant genetic material contained in each cell ofan organism and/or virus and the like.

[0068] By “inducible promoter” is meant promoter elements which areresponsible for expression of genes in response to a specific signalsuch as: physical stimuli (heat shock genes); light (RUBP carboxylase);hormone (Em); metabolites, chemicals, stress and the like.

[0069] By “plant” is meant a photosynthetic organism including botheukaryotes and prokaryotes.

[0070] By “promoter regulatory element” is meant nucleotide sequenceelements within a nucleic fragment or gene which controls the expressionof that nucleic acid fragment or gene. Promoter sequences provide therecognition for RNA polymerase and other transcriptional factorsrequired for efficient transcription. Promoter regulatory elements froma variety of sources can be used efficiently in plant cells to expressgene constructs. Promoter regulatory elements are also meant to includeconstitutive, tissue-specific, developmental-specific, induciblepromoters and the like. Promoter regulatory elements may also includecertain enhancer sequence elements and the like that improvetranscriptional efficiency.

[0071] By “tissue-specific” promoter is meant promoter elementsresponsible for gene expression in specific cell or tissue types, suchas the leaves or seeds (i.e., zein, oleosin, napin, ACP, globulin andthe like).

[0072] By “transformation” is meant a process of introducing anexogenous DNA sequence (e g., a vector, a recombinant DNA molecule) intoa cell or protoplast in which that exogenous DNA is incorporated into achromosome or is capable of autonomous replication.

[0073] By “transformed cell” is meant a cell whose DNA has been alteredby the introduction of an exogenous DNA molecule into that cell.

[0074] By “transgene” is meant an exogenous gene which when introducedinto the genome of a host cell through a process such as transformation,electroporation, particle bombardment, and the like, is expressed by thehost cell and integrated into the cells genome such that the trait ortraits produced by the expression of the tansgene is inherited by theprogeny of the transformed cell.

[0075] By “transgenic cell” is meant any cell derived or regeneratedfrom a transformed cell or derived from a transgenic cell. Exemplarytransgenic cells include plant calli derived from a transformed plantcell and particular cells such as leaf, root, stem, e.g., somatic cells,or reproductive (germ) cells obtained from a transgenic plant.

[0076] By “transgenic plant” is meant a plant or progeny thereof derivedfrom a transformed plant cell or protoplast, wherein the plant DNAcontains an introduced exogenous DNA molecule not originally present ina native, non-trarsgenic plant of the same strain. The tenms “transgenicplant” and “transformed plant” have sometimes been used in the art assynonymous terms to define a plant whose DNA contains an exogenous DNAmolecule. However, it is thought more scientifically correct to refer toa regenerated plant or callus obtained from a transformed plant cell orprotoplast as being a transgenic plant, and that usage will be followedherein.

[0077] By “vector” is meant a DNA molecule capable of replication in ahost cell and/or to which another DNA segment can be operatively linkedso as to bring about replication of the attached segment. A plasmid isan exemplary vector.

[0078] Nucleic Acid Molecules

[0079] Nucleic acid molecules of the present invention include, withoutlimitation, nucleic acid molecules having a nucleic acid sequenceselected from the group consisting of SEQ ID NOs: 1-84, 86, 88, 90, 92,94, 96, and 97 and complements thereof. A subset of the nucleic acidmolecules of the present invention includes nucleic acid molecules thatencode the NBS1, NBS2, NBS3, NBS4, NBS5, or NBS6 protein or a variantthereof. Such variants comprise an amino acid sequence which is at least90% identical to SEQ ID NO. 85, SEQ ID NO. 87, SEQ ID NO. 89, SEQ ID NO.91, SEQ ID NO. 93, or SEQ ID NO. 95. The amino acids which are notidentical, preferably, are conservative amino acid substitutions.

[0080] Another subset of the nucleic acid molecules of the presentinvention includes nucleic acid molecules that can be used as probes orprimers for selecting or identifying plants whose genome comprises anNBS rice blast resistant allele. A list of such primers is attached tothis application. The listed primers were designed based on the genomicsequences from BAC clones, DNA markers and other genomic clones. Theseprimers can be used in gene amplification and marker-aided selection.

[0081] Fragment nucleic acid molecules may comprise significantportion(s) of, or indeed most of, these nucleic acid molecules. Inpreferred embodiments, the fragments may comprise smallerpolynucleotides, e.g., oligonucleotides having from about 20 to about250 nucleotide residues and more preferably, about 40 to about 100nucleotide residues. Such fragments are useful as probes for identifyingplants whose genome includes an NBS rice blast resistant allele. Inanother preferred embodiment, fragment molecules may be at least 15nucleotides and are useful as primers for identifying or selectingplants whose genome includes an NBS rice resistant allele.

[0082] The nucleic acids may be single-stranded or double stranded. Suchnucleic acids may be DNA or RNA molecules

[0083] The term “isolated,” as used herein, refers to a moleculeseparated from substantially all other molecules normally associatedwith it in its native state. More preferably an isolated molecule is thepredominant species present in a preparation. An isolated molecule maybe greater than 60% free, preferably 75% free, more preferably 90% free,and most preferably 95% free from the other molecules (exclusive ofsolvent) present in the natural mixture. The term “isolated” is notintended to encompass molecules present in their native state.

[0084] It is understood that the nucleic acids of the present invention,particularly the probes and primers, may be labeled with reagents thatfacilitate detection of the agent, e.g., fluorescent labels, (Prober etal., Science 238:336-340 (1987); Albarella et al., EP 144914), chemicallabels, (Sheldon et al., U.S. Pat. No. 4,582,789; Albarella et al., U.S.Pat. No. 4,563,417), and modified bases, (Miyoshi et al., EP 119448)including nucleotides with radioactive elements, e.g., .sup.32P,.sup.33P, .sup.35S or .sup.1251, such as .sup.32P dCTP.

[0085] It is further understood, that the present invention providesrecombinant bacterial, animal, fungal and plant cells, plasmid and viralconstructs comprising the isolated nucleic acids of the presentinvention.

[0086] Nucleic acid molecules or fragments thereof of the presentinvention are capable of specifically hybridizing to other nucleic acidmolecules under certain circumstances. As used herein, two nucleic acidmolecules are said to be capable of specifically hybridizing to oneanother if the two molecules are capable of forming an anti-parallel,double-stranded nucleic acid structure. A nucleic acid molecule is saidto be the “complement” of another nucleic acid molecule if they exhibit“complete complementarity,” i.e., each nucleotide in one sequence iscomplementary to its base pairing partner nucleotide in anothersequence. Two molecules are said to be “minimally complementary” if theycan hybridize to one another with sufficient stability to permit them toremain annealed to one another under at least conventional“low-stringency” conditions. Similarly, the molecules are said to be“complementary” if they can hybridize to one another with sufficientstability to permit them to remain annealed to one another underconventional “high-stringency” conditions. Nucleic acid molecules whichhybridize to other nucleic acid molecules, e.g., at least under lowstringency conditions are said to be “hybridizable cognates” of theother nucleic acid molecules. Conventional stringency conditions aredescribed by Sambrook et al., Molecular Cloning, A Laboratory Manual,2nd Ed., Cold Spring Harbor Press, Cold Spring Harbor, N.Y. (1989) andby Haymes et al., Nucleic Acid Hybridization, A Practical Approach, IRLPress, Washington, D.C. (1985). Departures from complete complementarityare therefore permissible, as long as such departures do not completelypreclude the capacity of the molecules to form a double-strandedstructure. Thus, in order for a nucleic acid molecule to serve as aprimer or probe it need only be sufficiently complementary in sequenceto be able to form a stable double-stranded structure under theparticular solvent and salt concentrations employed.

[0087] Appropriate stringency conditions which promote DNAhybridization, for example, 6.0.times. sodium chloride/sodium citrate(SSC) at about 45.degree. C., followed by a wash of 2.0.times. SSC at50.degree. C., are known to those skilled in the art or can be found inCurrent Protocols in Molecular Biology, John Wiley & Sons, N.Y. (1989),6.3.1-6.3.6. For example, the salt concentration in the wash step can beselected from a low stringency of about 2.0.times. SSC at 50.degree. C.to a high stringency of about 0.2.times. SSC at 50.degree. C. Inaddition, the temperature in the wash step can be increased from lowstringency conditions at room temperature, about 22.degree. C., to highstringency conditions at about 65.degree. C. Both temperature and saltmay be varied, or either the temperature or the salt concentration maybe held constant while the other variable is changed.

[0088] In a preferred embodiment, a nucleic acid of the presentinvention will specifically hybridize to at least 15 contiguousnucleotides in one or more of the nucleic acid molecules set forth inSEQ ID NO: 83, 85, 87, 89, 91, 93, or 95 or complements thereof undermoderately stringent conditions, for example at about 2.0.times. SSC andabout 65.° C.

[0089] In a particularly preferred embodiment, a nucleic acid of thepresent invention will include those nucleic acid molecules thatspecifically hybridize to at least 15 contiguous nucleotides in one ormore of the nucleic acid molecules set forth in SEQ ID NO: 85, 87, 89,91, 93, or 95 or complements thereof under high stringency conditionssuch as 0.2.times. SSC and about 65° C.

[0090] In one aspect of the present invention, the nucleic acidmolecules of the present invention comprise one or more of the nucleicacid sequences set forth in SEQ ID NOs. 84, 86, 88, 90, 92, or 94 orcomplements thereof or fragments of either. In another aspect of thepresent invention, one or more of the nucleic acid molecules of thepresent invention share at least 90% sequence identity with one or moreof the nucleic acid sequences set forth in SEQ ID NO: 84, 86, 88, 90,92, or 94 or complements thereof.

[0091] As used herein “sequence identity” refers to the extent to whichtwo optimally aligned polynucleotide or peptide sequences are invariantthroughout a window of alignment of components, e.g., nucleotides oramino acids. An “identity fraction” for aligned segments of a testsequence and a reference sequence is the number of identical componentswhich are shared by the two aligned sequences divided by the totalnumber of components in reference sequence segment, i.e., the entirereference sequence or a smaller defined part of the reference sequence.“Percent identity” is the identity fraction times 100.

[0092] Useful methods for determining sequence identity are disclosed inGuide to Huge Computers, Martin J. Bishop, ed., Academic Press, SanDiego, 1994, and Carillo, H., and Lipton, D., SIAM J Applied Math (1988)48:1073. More particularly, preferred computer programs for determiningsequence identity include the Basic Local Alignment Search Tool (BLAST)programs which are publicly available from National Center BiotechnologyInformation (NCBI) at the National Library of Medicine, NationalInstitute of Health, Bethesda, Md. 20894; see BLAST Manual, Altschul etal., NCBI, NLM, NIH; Altschul et al., J. Mol. Biol. 215:403-410 (1990);version 2.0 or higher of BLAST programs allows the introduction of gaps(deletions and insertions) into alignments; BLASTX can be used todetermine sequence identity between a polynucleotide sequence query anda protein sequence database; and, BLASTN can be used to determinesequence identity between between sequences.

[0093] For purposes of this invention “percent identity” shall bedetermined using BLASTX version 2.0.14 (default parameters), BLASTNversion 2.0.14, or BLASTP 2.0.14.

[0094] The isolated nucleic acid molecules that encode SEQ ID NOs 85,87, 89, 91, 93, and 95 can be used to produce NBS 1, NBS 2, NBS 3, NBS4,NBS 5, and NBS6 proteins using any of a variety of methods known tothose skilled in the art. The amino acid sequence of the NBS1 protein isshown in FIG. 8 and set forth in SEQ ID NO. 85. One embodiment of anucleotide sequence encoding the NBS1 protein is shown in Fig. and setforth in SEQ ID NO. 84. The amino acid sequence of the NBS2 protein isshown in FIG. 8 and set forth in SEQ ID NO. 87. One embodiment of anucleotide sequence encoding the NBS2 protein is shown in FIG. 8 and setforth in SEQ ID NO. 86. The amino acid sequence of the NBS3 protein isshown in FIG. 8 and set forth in SEQ ID NO 89. One embodiment of anucleotide sequence encoding the NBS3 protein is shown in Fig. and setforth in SEQ ID NO. 88. The amino acid sequence of the NBS4 protein isshown in FIG. 8 and set forth in SEQ ID NO 91. One embodiment of anucleotide sequence encoding the NBS4 protein is shown in FIG. 8 and setforth in SEQ ID NO. 90. The amino acid sequence of the NBS6 protein isshown in FIG. 8 and set forth in SEQ ID NO 95. One embodiment of anucleotide sequence encoding the NBS5 protein is shown in FIG. 8 and setforth in SEQ ID NO. 94. The proteins may be used for various purposes.One purpose for the proteins is as antigens to cause production ofantibodies that react with the proteins. The present inventionencompasses such antibodies.

[0095] Modification and changes may be made in the structure of thepeptides of the present invention and DNA segments which encode them andstill obtain a functional molecule that encodes a protein or peptidewith desirable characteristics. The following is a discussion based uponchanging the amino acids of a protein to create an equivalent, or evenan improved, second-generation molecule. In particular embodiments ofthe invention, mutated proteins are contemplated to be useful forincreasing the rice blast disease resistance activity of the protein,and consequently increasing the activity and/or expression of therecombinant transgene in a plant cell. The amino acid changes may beachieved by changing the codons of the DNA sequence, according to thecodons given in Table 3. TABLE 3 Amino Acid Codons Alanine Ala A GCA GCCGCG GCU Cysteine Cys C UGC UGU Aspartic acid Asp D GAC GAU Glutamic acidGlu E GAA GAG Phenylalanine Phe F UUC UUU Glycine Gly G GGA GGC GGG GGUHistidine His H CAC CAU Isoleucine Ile I AUA AUC AUU Lysine Lys K AAAAAG Leucine Leu L UUA UUG CUA CUC CUG CUU Methionine Met M AUGAsparagine Asn N AAC AAU Proline Pro P CCA CCC CCG CCU Glutamine Gln QCAA CAG Arginine Arg R AGA AGG CGA CGC CGG CGU Serine Ser S AGC AGU UCAUCC UCG UCU Threonine Thr T ACA ACC ACG ACU Valine Val V GUA GUC GUG GUUTryptophan Trp W UGG Tyrosine Tyr Y UAC UAU

[0096] For example, certain amino acids may be substituted for otheramino acids in a protein structure without appreciable loss ofinteractive binding capacity with structures such as, for example,antigen-binding regions of antibodies or binding sites on substratemolecules. Since it is the interactive capacity and nature of a proteinthat defines that protein's biological functional activity, certainamino acid sequence substitutions can be made in a protein sequence,and, of course, its underlying DNA coding sequence, and neverthelessobtain a protein with like properties. It is thus contemplated by theinventors that various changes may be made in the peptide sequences ofthe disclosed compositions, or corresponding DNA sequences which encodesaid peptides without appreciable loss of their biological utility oractivity.

[0097] In making such changes, the hydropathic index of amino acids maybe considered. The importance of the hydropathic amino acid index inconferring interactive biologic function on a protein is generallyunderstood in the art (Kyte and Doolittle, 1982, incorporate herein byreference). It is accepted that the relative hydropathic character ofthe amino acid contributes to the secondary structure of the resultantprotein, which in turn defines the interaction of the protein with othermolecules, for example, enzymes, substrates, receptors, DNA, antibodies,antigens, and the like.

[0098] Each amino acid has been assigned a hydropathic index on thebasis of their hydrophobicity and charge characteristics (Kyte andDoolittle, 1982), these are: isoleucine (+4.5); valine (+4.2); leucine(+3.8); phenylalanine (+2.8); cysteine/cystine (+2.5); methionine(+1.9); alanine (+1.8); glycine (p.4); threonine (−7); serine (−8);trtyptophan (−0.9); tyrosine (−1.3); proline (−1.6); histidine (−3.2);glutamate (−3.5); glutamine (−3.5); aspartate (−3.5); asparagine (−3.5);lysine (−3.9); and arginine (−4.5).

[0099] It is known in the art that certain amino acids may besubstituted by other amino acids having a similar hydropathic index orscore and still result in a protein with similar biological activity,ie., still obtain a biological functionally equivalent protein. Inmaking such changes, the substitution of amino acids whose hydropathicindices are within .+−0.2 is preferred, those which are within .+−.1 areparticularly preferred, and those within .+−.0.5 are even moreparticularly preferred.

[0100] It is also understood in the art that the substitution of likeamino acids can be made effectively on the basis of hydrophilicity. U.S.Pat. No. 4,554,101, incorporated herein by reference, states that thegreatest local average hydrophilicity of a protein, as governed by thehydrophilicity of its adjacent amino acids, correlates with a biologicalproperty of the protein.

[0101] As described in U.S. Pat. No. 4,554,101, the followinghydrophilicity values have been assigred to amino acid residues:arginine (+3.0); lysine (+3.0); aspartate (+3.0.+−0.1); glutamate(+3.0.+−0.1); serine (+0.3); asparagine (+0.2); glutamine (+0.2);glycine (O); threonine (−0.4); proline (−0.5.+−0.1); alanine (−0.5);histidine (−0.5); cysteine (−1.0); methionine (−1.3); valine (−1.5);leucine (−1.8); isoleucine (−1.8); tyrosine (−2.3); phenylalanine(−2.5); tryptophan (−3.4).

[0102] It is understood that an amino acid can be substituted foranother having a similar hydrophilicity value and still obtain abiologically equivalent, and in particular, an immunologicallyequivalent protein. In such changes, the substitution of amino acidswhose hydrophilicity values are within .+−0.2 is preferred, those whichare within .+−0.1 are particularly preferred, and those within .+−.0.5are even more particularly preferred.

[0103] As outlined above, amino acid substitutions are generallytherefore based on the relative similarity of the amino acid side-chainsubstituents, for example, their hydrophobicity, hydrophilicity, charge,size, and the like. Exemplary substitutions which take variots of theforegoing characteristics into consideration are well known to those ofskill in th. art and include: arginine and lysine; glutamate andaspartate; serine and threonine; glutamine and asparagine; and valine,leucine and isoleucine.

[0104] Site-specific mutagenesis is a technique useful in thepreparation of individual peptides, or biologically functionalequivalent proteins or peptides, through specific mutageneis of theunderlying DNA. The technique further provides a ready ability toprepare aid test sequence variants, for example, incorporating one ormore of the foregoing considerations, by introducing one or morenucleotide sequence changes into the DNA. Site-specific mutagenesisallows the production of mutants through the use of specificoligonucleotide sequences which encode the DNA sequence of the desiredmutation, as well as a sufficient number of adjacent nucleotides, toprovide a primer sequence of sufficient size and sequence complexity toform a stable duplex on both sides of the deletion junction beingtraversed.

[0105] In general, the technique of site-specific mutagenesis is wellknown in the art, as exemplified by various publications. As will beappreciated, the technique typically employs a phage vector which existsin both a single stranded and double stranded form. Typical vectorsuseful in site-directed mutagenesis include vectors such as the M13phage. These phage are readily commercially available and their use isgenerally well known to those skilled in the art. Double strandedplasmids are also routinely employed in site directed mutagenesis whicheliminates the step of transferring the gene of interest from a plasmidto a phage.

[0106] The preparation of sequence variants of the selectedpeptide-encoding DNA segments using site-directed mutagenesis isprovided as a means of producing potentially useful species and is notmeant to be limiting as there are other ways in which sequence variantsof peptides and the DNA sequences encoding them may be obtained. Forexample, recombinant vectors encoding the desired peptide sequence maybe treated with mutagenic agents, such as hydroxylamine, to obtainsequence variants.

[0107] Concerning the amino acid sequences of the inventive proteinsdisclosed herein, variants of those proteins are also encompassed withinthe scope of this invention. Such encompassed variants have at least 90%amino acid sequence identity with one or more of the inventive proteinsdisclosed herein. Such variants include, for instance, proteins whereinone or more amino acid residues are added or deleted at the N- orC-terminus of the sequence of the disclosed protein sequences or one ormore amino acid residues within the disclosed protein sequences aresubstituted, preferably with a conservative amino acid. Ordinarily, thedisclosed protein sequence variants will have at least about 90% aminoacid sequence identity, more preferably at least about 95% amino acidsequence identity, with the amino acid sequence of the disclosedproteins. Percent (%) amino acid sequence identity with respect to thesequence herein is defined as the percentage of amino acid residues in acandidate sequence that are identical with the amino acid residues inthe disclosed protein sequences, after aligning the sequences andintroducing gaps, if necessary, to achieve the maximum percent sequenceidentity, and not considering any conservative substitutions as part ofthe sequence identity. Alignment for purposes of determining percentamino acid sequence identity can be achieved in various ways that arewithin the skill in the art, for instance, using publicly availablecomputer software such as ALIGN.TM. or Megalign (DNASTAR) software.Those skilled in the art can determine appropriate parameters formeasuring alignment, including any algorithms needed to achieve maximalalignment over the full length of the sequences being compared.

[0108] Preferably, the deletions and additions are located at the aminoterminus, the carboxy terminus, or both, of the disclosed proteinsequences. Amino acid substitutions are generally based on the relativesimilarity of the amino acid side-chain substituents, for example, theirhydrophobicity, hydrophilicity, charge, size, and the like.

[0109] It is known in the art that certain amino acids may besubstituted by other amino acids having a similar hydropathic indexscore and a similar hydrophilicity value and still result in a proteinwith similar biological activity, i.e., still obtain a biologicalfunctionally equivalent protein. It is also understood in the art thatthe substitution of like amino acids can be made effectively on thebasis of hydrophilicity. U.S. Pat. No. 4,554,101, incorporated herein byreference, states that the greatest local average hydrophilicity of aprotein, as governed by the hydrophilicity of its adjacent amino acids,correlates with a biological property of the protein.

[0110] Transgenic Plants Comprising an NBS Rice Blast Resistant Allele

[0111] The present invention provides a method of producing a transgenicplant whose genome comprises an NBS rice resistant allele. The methodcomprises introducing a transgene or DNA construct comprising a nucleicacid that encodes an NBS 1 protein, an NBS2 protein, an NBS3 protein, anNBS4 protein, an NBS5 protein, and NBS6 protein or combinations thereofand a promoter which is operably linked to said nucleic acid into aplant cell or protoplast, and regenerating a plant from said plant cellor said protoplast.

[0112] Another aspect of the invention comprises a transgenic plantwhose genome comprises a transgene or DNA expression construct thatencodes and expresses the present NBSI, NBS2, NBS32, NBS4, NBS5, or NBS6protein.

[0113] Types of Plants in Which the Invention can be Used

[0114] The transgene of the present invention can be introduced in avariety of non-transgenic host plants. The grasses are one family ofplants that can be used to produce the present transgenic plants,however, the invention may be used in other families of plants. Thegrass family comprises the plants in the family Gramineae (also calledPoaceae). This family comprises plants such as maize, wheat, rice,barley, turfgrass, ryegrass, stall fascue grass, other turf plants,orghum, rye, and sugar cane. The plants comprising rice is preferablyused to practice the invention. One genus in the rice family is Oryza.Within the genus Oryza, a variety of species are found. These speciescomprise Octopus vulgaris, Onchocerca cervicalis, Onchocerca volvulus,Oryctolagus cuniculus, Oryza australiensis, Oryza brachyantha, Oryzalatifolia, Oryza minuta, Oryza nivarra, Oryza officinalis, Oryzapunctata, Oryza rufipogon, Oryza sativa, Oryza longistaminata, Oryzaglaberrima, Oryza eichingeri, Oryza grandiglumis, Oryza perennis, Oryzaglumaepatula, Oryza meridionalis, Oryza alta and other genera withinthis species. All of these species may be used in practice of thepresent invention.

[0115] Types of Diseases and Infective Organisms the Invention Can beUsed to Prevent

[0116] The present invention, especially the genes of the presentinvention, are used to make strains of plants that are resistance toparticular diseases, the diseases being caused by infective organisms.The genes and proteins of the present invention can be used to preventinfection of any infective organisms to which the inventive genes andproteins provide protection. Preferably, the present invention is usedto prevent, lessen the severity of, or lessen the occurrence of riceblast disease. Rice blast disease is caused by organisms of the genusMagnaporthe (also called Pvricularia). One such organism is Magnaporthegrisea. However, the present invention is not limited to protectionagainst these particular infective organisms. The inventive genes andproteins may provide protection against infection (i.e., resistance toinfection) against other infective organisms.

[0117] Aside from rice, the Magnaporthe grisea fungus can also attackmore than fifty other species of grasses and sedges. The presentinvention can be practiced using any of the species of plants that theorganisms causing rice blast disease can infect.

[0118] The effects of rice blast disease on plants and theidentification of the disease in plants are well known in the art andare described in various publications such as MP 645, Rice Blast:Identification and Control and MP 646, Rice Sheath Blight Control. Thesepublications are incorporated herein by reference and are available fromthe Delta Center, Missouri Agricultural Experiment Station, P.O. Box160, Portageville, Mo. 63873, or are be available a on the World WideWeb at (aes.missouri.edu/delta).

[0119] Preparation of the Transgenic Plant

[0120] The genes of the present invention are introduced into andexpressed in the plants that are susceptible to Magnaporthe grisea. Oneor more of the genes, or all of the genes can be introduced into plantsto provide the desired result of resistance to the rice blast diseasefungus. There are a variety of methods by which the genes can beintroduced into plants and transferred between plants. When rice plantsare used, genes can be introduced into the rice using various methods oftransformation. Such plants that contain an introduced gene are referredto as transgenic plants. Such transformation methods include biolisticmethods, Agrobacterium tumefaciens-based methods, and methods involvingdirect gene transfer into protoplasts. Many of these methods are eitherdescribed or referenced in a paper that is attached to and is part ofthis application. This paper is authored by Yin and Wang and waspublished in Theor Appl Genet in 2000. This paper is incorporated intothe present application by reference. An article authored by V. L. Munizde Padua et. al. published in Plant Molecular Biology Reporter in 2001,describes additional methods for transforming plant cells.

[0121] Once genes have been introduced into rice plants, the genes canbe moved from plant to plant using various genetic methods, traditionalbreeding methods, that are well known to those skilled in the art. Suchmethods include crosses or genetic crosses between the plants.Techniques such as embryo rescue can also be used.

[0122] Methods for DNA transformation of plant cells includeAgrobacterium-mediated plant transformation, protoplast transformation,gene transfer into pollen, injection into reproductive organs, injectioninto immature embryos and particle bombardment. Each of these methodshas distinct advantages and disadvantages. Thus, one particular methodof introducing genes into a particular plant strain may not necessarilybe the most effective for another plant strain, but it is well knownwhich methods are useful for a particular plant strain.

[0123] There are many methods for introducing transforming DNA segmentsinto cells, but not all are suitable for delivering DNA to plant cells.Suitable methods are believed to include virtually any method by whichDNA can be introduced into a cell, such as infection by A. tamefaciensand related Agrobacterium, direct delivery of DNA such as, for example,by PEG-mediated transformation of protoplasts (Ominilleh et al., 1993),by desiccation/inhibition-mediated DNA uptake, by electroporation, byagitation with silicon carbide fibers, by acceleration of DNA coatedparticles, etc. In certain embodiments, acceleration methods arepreferred and include, for example, microprojectile bombardment and thelike.

[0124] Technology for introduction of DNA into cells is well-known tothose of skill in the art. Four general methods for delivering a geneinto cells have been described: (1) chemical methods (Graham and van derEb, 1973); (2) physical methods such as microinjection (Capecchi, 1980),electroporation (Wong and Neumann, 1982; Fromm et al., 1985) and thegene gun (Johnston and Tang, 1994; Fynan et al., 1993); (3) viralvectors (Clapp, 1993; Lu et al., 1993; Eglitis and Anderson, 1988a;1988b); and (4) receptor-mediated mechanisms (Curiel et al., 1991; 1992;Wagner et al., 1992).

[0125] Electroporation

[0126] The application of brief, high-voltage electric pulses to avariety of animal and plant cells leads to the formation ofnanometer-sized pores in the plasma membrane. DNA is taken directly intothe cell cytoplasm either through these pores or as a consequence of theredistribution of membrane components that accompanies closure of thepores. Electroporation can be extremely efficient and can be used bothfor transient expression of clones genes and for establishment of celllines that carry integrated copies of the gene of interest.Electroporation, in contrast to calcium phosphate-mediated transfectionand protoplast fusion, frequently gives rise to cell lines that carryone, or at most a few, integrated copies of the foreign DNA.

[0127] The introduction of DNA by means of electroporation, iswell-known to those of skill in the art. In this method, certain cellwall-degrading enzymes, such as pectin-degrading enzymes, are employedto render the target recipient cells more susceptible to transformationby electroporation than untreated cells. Alternatively, recipient cellsare made more susceptible to transformation, by mechanical wounding. Toeffect transformation by electroporation one may employ either friabletissues such as a suspension culture of cells, or embryogenic callus, oralternatively, one may transform immature embryos or other organizedtissues directly. One would partially degrade the cell walls of thechosen cells by exposing them to pectin-degrading enzymes (pectolyases)or mechanically wounding in a controlled manner. Such cells would thenbe recipient to DNA transfer by electroporation, which may be carriedout at this stage, and transformed cells then identified by a suitableselection or screening protocol dependent on the nature of the newlyincorporated DNA.

[0128] Microprojectile Bombardment

[0129] A further advantageous method for delivering transforming DNAsegments to plant cells is microprojectile bombardment. In this method,particles may be coated with nucleic acids and delivered into cells by apropelling force. Exemplary particles include those comprised oftungsten, gold, platinum, and the like.

[0130] An advantage of microprojectile bombardment, in addition to itbeing an effective means of reproducibly stably transforming monocots,is that neither the isolation of protoplasts (Cristou et al., 1988) northe susceptibility to Agrobacterium infection is required. Anillustrative embodiment of a method for delivering DNA into maize cellsby acceleration is a Biolistics Particle Delivery System, which can beused to propel particles coated with DNA or cells through a screen, suchas a stainless steel or Nytex screen, onto a filter surface covered withcorn cells cultured in suspension. The screen disperses the particles sothat they are not delivered to the recipient cells in large aggregates.It is believed that a screen intervening between the projectileapparatus and the cells to be bombarded reduces the size of projectilesaggregate and may contribute to a higher frequency of transformation byreducing damage inflicted on the recipient cells by projectiles that aretoo large.

[0131] For the bombardment, cells in suspension are preferablyconcentrated on filters or solid culture medium. Alternatively, immatureembryos or other target cells may be arranged on solid culture medium.The cells to be bombarded are positioned at an appropriate distancebelow the macroprojectile stopping plate. If desired, one or morescreens are also positioned between the acceleration device and thecells to be bombarded. Through the use of techniques set forth hereinone may obtain up to 1000 or more foci of cells transiently expressing amarker gene. The number of cells in a focus which express the exogenousgene product 48 hours post-bombardment often range from 1 to 10 andaverage 1 to 3.

[0132] In bombardment transformation, one may optimize theprebombardment culturing conditions and the bombardment parameters toyield the maximum numbers of stable transformants. Both the physical andbiological parameters for bombardment are important in this technology.Physical factors are those that involve manipulating theDNA/microprojectile precipitate or those that affect the flight andvelocity of either the macro- or microprojectiles. Biological factorsinclude all steps involved in manipulation of cells before andimmediately after bombardment, the osmotic adjustment of target cells tohelp alleviate the trauma associated with bombardment, and also thenature of the transforming DNA, such as linearized DNA or intactsupercoiled plasmids. It is believed that pre-bombardment manipulationsare especially important for successful transformation of immatureembryos.

[0133] Accordingly, it is contemplated that one may wish to adjustvarious of the bombardment parameters in small scale studies to fullyoptimize the conditions. One may particularly wish to adjust physicalparameters such as gap distance, flight distance, tissue distance, andhelium pressure. One may also minimize the trauma reduction factors(TRFs) by moditying conditions which influence the physiological stateof the recipient cells and which may therefore influence transformationand integration efficiencies. For example, the osmotic state, tissuehydration and the subculture stage or cell cycle of the recipient cellsmay be adjusted for optimum transformation. The execution of otherroutine adjustments will be known to those of skill in the art in lightof the present disclosure.

[0134] The methods of particle-mediated transformation is well-known tothose of skill in the art. U.S. Pat. No. 5,015,580 (specificallyincorporated herein by reference) describes the transformation ofsoybeans using such a technique.

[0135] Agrobacterium-Mediated Transfer

[0136] Agrobacterium-mediated transfer is a widely applicable system forintroducing genes into plant cells because the DNA can be introducedinto whole plant tissues, thereb bypassing the need for regeneration ofan intact plant from a protoplast. The use of Agrobactenum-mediatedplant integrating vectors to introduce DNA into plant cells is wellknown in the art. See, for example, the methods described (Fraley etal., 1985; Rogers et al., 1987). The genetic engineering of cottonplants using Agrobacterium-mediated transfer is described in U.S. Pat.No. 5,004,863 (specifically incorporated herein by reference), while thetransformation of lettuce plants is described in U.S. Pat. No. 5,349,124(specifically incorporated herein by reference). Further, theintegration of the Ti-DNA is a relatively precise process resulting infew rearrangements. The region of DNA to be transferred is defined bythe border sequences, and intervening DNA is usually inserted into theplant genome as described (Spielmann et al., 1986; Jorgensen et al.,1987).

[0137] Modern Agrobacterium transformation vectors are capable ofreplication in E. coli as well as Agrobacterium, allowing for convenientmanipulations as described (Klee et al., 1985). Moreover, recenttechnological advances in vectors for Agrobacterium-mediated genetransfer have improved the arrangement of genes and restriction sites inthe vectors to facilitate construction of vectors capable of expressingvarious polypeptide coding genes. The vectors described (Rogers et al.,1987), have convenient multi-linker regions flanked by a promoter and apolyadenylation site for direct expression of inserted polypeptidecoding genes and are suitable for present purposes. In addition,Agrobacterium containing both armed and disarmed Ti genes can be usedfor the transformations. In those plant strains whereAgrobacterium-mediated transformation is efficient, it is the method ofchoice because of the facile and defined nature of the gene transfer.

[0138] Agrobacterium-mediated transformation of leaf disks and othertissues such as cotyledons and hypocotyls appears to be limited toplants that Agrobacterium naturally infects. Agrobacterium-mediatedtransformation is most efficient in dicotyledonous plants. Few monocotsappear to be natural hosts for Agrobacterium, although transgenic plantshave been produced in asparagus using Agrobacterium vectors as described(Bytebier et al., 1987). Therefore, commercially important cereal grainssuch as rice, corns and wheat must usually be transformed usingalternative methods. However, as mentioned above, the transformation ofasparagus using Agrobacterium can also be achieved (see, e.g., Bytebieret al., 1987).

[0139] A transgenic plant formed using Agrobacterium transformationmethods typically contains a single gene on one chromosome. Suchtransgenic plants can be referred to as being heterozygous for the addedgene. However, inasmuch as use of the word “heterozygous” usuallyimplies the presence of a complementary gene at the same locus of thesecond chromosome of a pair of chromosomes, and there is no such gene ina plant containing one added gene as here, it is believed that a moreaccurate name for such a plant is an independent segregant, because theadded, exogenous gene segregates independently during mitosis andmeiosis.

[0140] More preferred is a transgenic plant that is homozygous for theadded structural gene; i.e., a trarsgenic plant that contains two addedgenes, one gene at the same locus on each chromosome of a chromosomepair. A homozygous transgenic plant can be obtained by sexually mating(selfing) an independent segregant trarsgenic plant that contains asingle added gene, germinating some of the seed produced and analyzingthe resulting plants produced for enhanced carboxylase activity relativeto a control (native, non-transgenic) or an independent segreganttmnsgenic plant.

[0141] It is to be understood that two different tmnsgenic plants canalso be mated to produce offspring that contain two independentlysegregating added, exogenous genes. Selfing of appropriate progeny canproduce plants that are homozygous for both added, exogenous genes thatencode a polypeptide of interest. Back-crossing to a parental plant andout-crossing with a non-tnansgenic plant are also contemplated.

[0142] Transformation of plant protoplasts can be achieved using methodsbased on calcium phosphate precipitation, polyethylene glycol treatment,electroporation, and combinations of these treatments (see, e.g.,Potrykus et al., 1985; Lorz et al., 1985; Fromm et al., 1985; Uchimiyaet al., 1986; Callis et al., 1987; Marcotte et al., 1988).

[0143] Application of these systems to different plant strains dependsupon the ability to regenerate that particular plant strain fromprotoplasts. Illustrative methods for the regeneration of cereals fromprotoplasts are described (see, e.g., Fujimura et al., 1985; Toriyama etal, 1986; Yamada et al., 1986; Abdullah et al., 1986).

[0144] To transform plant strains that cannot be successfullyregenerated from protoplasts, other ways to introduce DNA into intactcells or tissues can be utilized. For example, regeneration of cerealsfrom immature embryos or explants can be effected as described (Vasil,1988). In addition, “particle gun” or high-velocity microprojectiletechnology can be utilized (Vasil, 1992).

[0145] Using that latter technology, DNA is carried through the cellwall and into the cytoplasm on the surface of small metal particles asdescribed (Klein et al., 1987; Klein et al., 1988; McCabe et al., 1988).The metal particles penetrate through several layers of cells and thusallow the transfomation of cells within tissue explants.

[0146] Methods for DNA transformation of plant cells includeAgrobacterium-mediated plant transformation, protoplast transformation,gene transfer into pollen, injection into reproductive organs, injectioninto immature embryos and particle bombardment. Each of these methodshas distinct advantages and disadvantages. Thus, one particular methodof introducing genes into a particular plant strain may not necessarilybe the most effective for another plant strain, but it is well knownwhich methods are useful for a particular plant strain.

[0147] There are many methods for introducing transforming DNA segmentsinto cells, but not all are suitable for delivering DNA to plant cells.Suitable methods are believed to include virtually any method by whichDNA can be introduced into a cell, such as infection by A. tamefaciensand related Agrobacterium, direct delivery of DNA such as, for example,by PEG-mediated transformation of protoplasts (Ominilleh et al., 1993),by desiccation/inhibition-mediated DNA uptake, by electroporation, byagitation with silicon carbide fibers, by acceleration of DNA coatedparticles, etc. In certain embodiments, acceleration methods arepreferred and include, for example, microprojectile bombardment and thelike.

[0148] Method of Identifying Plants Comprising an NBS Rice BlastResistant Allele

[0149] The present invention also comprises methods of identifyingtransgenic or non-transgenic plants that comprise an NBS rice blastresistant allele. The methods comprise isolating DNA or RNA from a cellof the plant and assaying for the presence of such allele using theprimers or probes of the present invention.

[0150] In one aspect the method is a polymerase chain reaction whichemploys two primers that amplify the entire coding region of the NBSgene or a unique fragment within one or more of the NBS genes. Oneprimer is located at each end of the region to be amplified. Suchprimers will normally be between 10 to 30 nucleotides in length and havea preferred length from between 18 to 22 nucleotides. PCR primers can beselected to amplify the entire sequence set forth in SEQ ID NOs. 85, 87,89, 91 or 93., in which case primers are located at the 5′ and 3′ endsof the illustrated sequences. PCR primers can also be selected toamplify only a part of the sequences set forth in SEQ ID NOs. 85, 87,89, 91 or 93, in which case at least one of the two primers is locatedinternal to the 5′ and 3′ ends of the illustrated sequences. Thesmallest such sequence that can be amplified is approximately 50nucleotides in length (e.g., a forward and reverse primer, both of 20nucleotides in length, whose location in the sequences in SEQ ID NOs.set forth in SEQ ID NOs. 85, 87, 89, 91 or 93 is separated by at least10 nucleotides). Any sequence of approximately 50 nucleotides in lengththat is within the sequences set forth in SEQ ID NOs. 85, 87, 89, 91 or93 is within the scope of this application.

[0151] One primer is called the “forward primer” and is located at theleft end of the region to be amplified. The forward primer is identicalin sequence to the strand of the DNA set forth in SEQ ID NOs. 85, 87,89, 91 or 93. The forward primer hybridizes to the strand of the DNAwhich is complementary to the strand of the DNA set forth in SEQ ID NOs.85, 87, 89, 91 or 93. With reference to the sequences as oriented in setforth in SEQ ID NOs. 85, 87, 89, 91 or 93, the forward primer primessynthesis of DNA in a leftward to rightward direction.

[0152] The other primer is called the “reverse primer” and is located atthe right end of the region to be amplified. The reverse primer iscomplementary in sequence to the strand of the DNA set forth in SEQ IDNOs. 85, 87, 89, 91 or 93. The reverse primer hybridizes to the strandof the DNA set forth in SEQ ID NOs. 85, 87, 89, 91 or 93. With referenceto the sequences as oriented in set forth in SEQ ID NOs. 85, 87, 89, 91or 93, the reverse primer primes synthesis of DNA in a rightward toleftward direction.

[0153] Preferably, the primers that are chosen to amplify a sequencewithin SEQ NOs. 85, 87, 89, 91 or 93 are between 15 to 30 nucleotides inlength, more preferably 18 to 25 in length, most preferably between 18to 22 nucleotides in length.

[0154] PCR primers should also be chosen subject to a number of otherconditions. PCR primers should be long enough (preferably 15 to 18nucleotides in length) to minimize hybridization to greater than oneregion in the genomic template DNA. Primers with long runs of a singlebase should be avoided, if possible. Primers should preferably have apercent G+C content of between 40 and 60%. If possible, the percent G+Ccontent of the 3′ end of the primer should be higher than the percentG+C content of the 5′ end of the primer. Primers should not containsequences that can hybridize to another sequence within the primer(i.e., palindromes). Two primers used in the same PCR reaction shouldnot be able to hybridize to one another. Although PCR primers arepreferably chosen subject to the recommendations above, it is notnecessary that the primers conform to these conditions. Other primersmay work, but have a lower chance of yielding good results.

[0155] PCR primers that can be used to amplify DNA within a givensequence are preferably chosen using one of a number of computerprograms that are available. Such programs choose primers that areoptimum for amplification of a given sequence (i.e., such programschoose primers subject to the conditions stated above, plus otherconditions that may maximize the functionality of PCR primers). Onecomputer program is the Genetics Computer Group (GCG recently becameAccelrys) analysis package which has a routine for selection of PCRprimers. There are also several web sites that can be used to selectoptimal PCR primers to amplify an input sequence. One such web site ishttp://alces.med.umn.edu/rawprimer.html. Another such web site ishttp://www-genome.wi.mit.edu/cgi-bin/primer/primer3_www.cgi.

[0156] Once PCR primers are chosen, they are used in a PCR reaction. Astandard PCR reaction contains a buffer containing 10 mM Tris-HCl (pH8.3), 50 mM KCl, and 6.0 mM MgCl₂, 200 uM each of dATP, dCTP, dTTP anddGTP, two primers of concentration 0.5 uM each, 7.5 ng/ul concentrationof template DNA and 2.5 units of Taq DNA Polymerase enzyme. Variationsof these conditions can be used and are well known to those skilled inthe art.

[0157] The PCR reaction is performed under high stringency conditions.Herein, “high stringency PCR conditions” refers to conditions that donot allow base-pairing mismatches to occur during hybridization ofprimer to template. Such conditions are equivalent to or comparable todenaturation for 1 minute at 95° C. in a solution comprising 10 mMTris-HCl (pH 8.3), 50 mM KCl, and 6.0 mM MgCl₂, followed by annealing inthe same solution at about 62° C. for 5 seconds.

[0158] Successful amplification of the template DNA to produce a PCRproduct of the correct size (i.e., a size equivalent to the length ofthe two primers plus the length of DNA between the two primers as setforth in SEQ ID NOS. 85, 87, 89, 91, and 93 is determinative of whetherthe genome of the plant that is tested contains the targeted NBS riceblast resistant allele. Absence of a PCR product of the correct sizeindicates that the genome does not contain the targeted NBS rice blastreistant allele.

REFERENCES

[0159] Amante-Bordeos, A., Sitch, L. A., Nelson, R. Damacio, R. D.Oliva, N. P., Aswidinnoor, H., and Leung H. 1992. Transfer of bacterialblight and blast resistance from the tetraploid wild rice Oryza minutato cultivated rice, Oryza sativa. Theor. Appl. Genet. 84:345-354.

[0160] Baker, B., Zambryski, P., Stakawicz, B., and Dinesh-Kumar S. P.1997. Signaling in plant-microbe interactions. Science 276:726-732.

[0161] Bent, A. F. 1996. Plant disease resistance genes: function meetsstructure. Plant Cell 8:1757-1771.

[0162] Bonman, J. M., Vergel De Dios, T. I., and Khin, M. M. 1986.Physiologic specialization of Pyricularia oryzae in the Philippines.Plant Disease 70:767-769.

[0163] Bonman, J. M., Khush, G. S., Nelson, R. 1992. Breeding rice forresistance to pests. Annu. Rev. Phytopathol. 30:507-528.

[0164] Botella, M. A., Parker, J. E., Frost, L. N., Bittner-Eddy, P. D.,Beynon, J. L., Daniels, M. J., Holub, E. B., Jones, J. D. 1998. Threegenes of the Arabidopsis RPPI complex resistance locus recognizedistinct Peronospora parasitica avirulence determinants. Plant Cell 10:1847-60.

[0165] Bryan, G. T., Wu, K. S., Farrall, L., Jia, Y., Hershey, H. P.,McAdams, S. A., Faulk, K. N., Donaldson, G. K., Tarchini, R., Valent, B.2000. A single amino acid difference distinguishes resistant andsusceptible alleles of the rice blast resistance gene Pi-ta. Plant Cell.12:2033-46.

[0166] Chen, D., Zeigler, R. S., Ahn, S. W., Nelson, R. J. 1996.Phenotypic characterization of the rice blast resistance gene Pi-2 (t).Rice. Plant Dis 80:52-56.

[0167] Chen, C., Huang, S., Ling, P., Yu, C., Deng, Z., Gmitter, F. G.1999. A Novel method to clone BAC insert ends based on double digestion.p17, Abstract presented at the Plant and Animal Genome VII, San Diego,Calif. Jan 17-21, 1999.

[0168] Dellaporta, S. L., Wood, J., Hicks, J. B. 1984. Maize DNAminiprep. In: Molecular Biology of Plants. A Laboratory Course Manual,M. Russell, Eds. Cold Spring Harbor Laboratory, Cold Spring Harbor,N.Y., pp 36-37

[0169] Huang, N., McCouch, S., Mew, T., Parco, A., Guiderdoni, E. 1994.Development of an RFLP map from a doubled haploid population in rice.Rice Genetics Newsletter 11:134-137.

[0170] Inukai, T., Mackill, D. J., Bonman, J. M., Sarkarung, S.,Zeigler, R., Nelson, R., Takamure, I., and Kinoshita, T. 1992. Blastresistance gene Pi2(t) and Pi-Z may be allelic. Rice Genetics Newsletter9:90-92.

[0171] Jia, Y., McAdams, S. A., Bryan, G. T., Hershey, H. P., andValent, B. 2000. Direct interaction of resistance gene and avirulencegene products confers rice blast resistance. EMBO J. 19:4004-14.

[0172] Mackill, D. J., and Bonman, J. M. 1992. Inheritance of blastresistance in near-isogenic lines of rice. Phytopathology 82:746-749.

[0173] McCouch, S. R., Nelson, R. J., Tohme, J., and Zeigler, R. S.1994. Mapping of blast resistance genes in rice. In Rice Blast Disease,eds Zeigler, R. S., Leong S. A. and Teng, P. S., CAB International. pp167-186.

[0174] Meyers, B. C., Chin, D. B., Shen, K. A., Sivaramakrishnan, S.,Lavelle, D. O., Zhang, Z., and Michelmore, R. W. 1998. The majorresistance gene cluster in lettuce is highly duplicated and spansseveral megabases. Plant Cell 10:1817-32.

[0175] Michelmore, R. W., and Meyers, B. C. 1998. Clusters of resistancegenes in plants evolve by divergent selection and a birth-and-deathprocess. Genome Res 8:1113-30.

[0176] Michelmore, R. W., Paran, I., and Kesseli, R. V. 1991.Identification of markers linked to disease-resistance genes by bulkedsegregant analysis: a rapid method to detect markers in specific genomicregions by using segregating populations. Proc Natl Acad Sci USA.88:9828-32.

[0177] Moffat, A. S. 1994. Mapping the sequence of disease resistance.Science 265:1804-1805.

[0178] Ou, S. H. 1985. pp 109-201 in Rice Disease, Ed. 2. The CambrianNews Ltd., UK.

[0179] Paran, I., and Michelmore, R. W. 1993. Development of reliablePCR-based markers linked to downy mildew resistance genes in lettuce.Theor Appl Genet 85:985-993.

[0180] Sambrook, J., Fritsch, E. F., and Maniatis, T. 1989. MolecularCloning: A Laboratory Manual, 2nd Ed. Cold Spring Harbor, N.Y.

[0181] Sitch, L. A., Amante, A. D., Dalmacio, R. D. and Leung, H. 1989.Oryza minuta, a source of blast and bacterial blight resistance for riceimprovement. In Review of Advance in Plant Biotechnology, 1985-1988.Eds: A. Mujeeb-Kazi and LA Sitch, International Maize and WheatImprovement Center and International Rice Research Institute.

[0182] Shizuya, H., Birren, B., Kim, U. J., Mancino, V., Slepak, T.,Tachiiri, Y., and Simon, M. 1992. Cloning and stable maintenance of300-kilobase-pair fragments of human DNA in Escherichia coli using anF-factor-based vector. Proc Natl Acad Sci USA 89:8794-7.

[0183] Song, W. Y., Wang, G. L., Chen, L., Zhai, W., Kim, H. K.,Holsten, T., Zhu, L. and Ronald, P. 1995. The rice disease resistancegene, Xa21, encodes a receptor-like protein kinase. Science:270:1804-1806.

[0184] Williams, J. G. K., Kubelik, A. R., Livak, K. J., Rafalski, J.A., and Tingey, S. V. 1990. DNA polymorphisms amplified by arbitraryprimers are useful as genetic markers. Nucleic Acids Res. 18:6531-6535.

[0185] Wang, G. L., Mackill, D. J., Bonman, J. M., McCouch, S. R., andNelson, R. J. 1994. RFLP mapping of genes conferring complete andpartial resistance to blast in a durably resistant rice cultivar.Genetics, 136:1421-1434.

[0186] Wang, G. L., Holsten T. E., Song, W. Y., Wang, H. P., Ronald,P.C. 1995. Construction of a rice bacterial artificial chromosomelibrary and identification of clones linked to the Xa21 diseaseresistance locus. The Plant Journal 7:525-533.

[0187] Wang, Z. X., Yano, M., Yamanouchi, U., Iwamoto, M., Monna, L.,Hayasaka, H., Katayose, Y., Sasaki, T. 1999. The Pib for rice blastresistance belongs to the nucleotide binding and leucine-rich repeatclass of plant disease resistance genes. The Plant J. 19:55-64.

[0188] Wei, F., Gobelman-Werner, K., Morroll, S. M., Kurth, J., Mao, L.,Wing, R., Leister, D., Schulze-Lefert, P., and Wise, R. P. 1999. The Mla(powdery mildew) resistance cluster is associated with three NBS-LRRgene families and suppressed recombination within a 240-kb DNA intervalon chromosome 5S (1HS) of barley. Genetics 153:1929-48.

[0189] Yu, Z. H, Mackill, D. J., and Bonman, J. M., and Tanksley, S. D.1991. Tagging genes for blast resistance in rice via linkage to RFLPmarkers. Theo. Appl. Genet. 81:471-476.

[0190] Yokoo, M., Kikuchi, F., Fujimaki, H. 1978. Breeding of blastresistance lines (BL1-7) from indicajaponica crosses of rice. Jpn JBreed 28:359-385. TABLE 1 Disease reaction of lines containing Pi9, Pi2or neither allele to 43 blast isolates from 14 countries. R standsresistance reaction, MR for medium or partial resistance reaction, and Sfor susceptible reaction. 75-1-127 C101A51 Isolate Country IR31917 (Pi9)CO39 (Pi2) ML8 Mali S R S R ML25 Mali S R S S ML33 Mali S R S R ML53Mali S R S S 95090B(119) China S R S R 96017(138) China S R S RSA5ZB13(58) China S R S R WAN97ZA13 China S R S R 95116AZC13 China S R SR 95033ZB15 China S R S R ZHONG39ZA7 China S R S R (3) 36B23 China S R SR CHE86056ZB13 China S R S R 95097AZC13 China S R S R CHE86061ZE13 ChinaS R S R 47ZB15(67) China S R S R ZHONG79ZC15 China S R S R 87024ZG1China S R S MR 78334ZA27(7) China S R S R ZHONGSZC7 China S R S R CH64China S R S R 97-4-1 China R R S R 97-104-2 China S R S R 97-5-1 China SR S R 97-102-2 China S R S R 97-55-2 China S R S R 54-04 China S R S R10-18-14 China S R S S TH4 Thailand S R S S PO6-6 Philip- S R S R pinesC9240-1 Philip- S R S S pines PH9 Philip- S R S R pines DB24 Burundi S RS S CD101 Ivory S R S R Coast GUY11 France S R S S ES5 Spain S R S R ES6Spain S R S R IT10 Italy S R S R PR3 Portugal MR R S R US30 USA R R S RCL6 Colombia S R S R KJ201 South S R S R Korea KI1117 South S R S RKorea

[0191] TABLE 2 NBS1 NBS2 NBS3 NBS4 NBS5 NBS6 PIB NBS 1 57.75 62.88 49.6962.78 59.06 32.05 66.23 69.63 57.35 69.53 67.11 42.14 NBS2 55.36 45.5955.16 48.86 33.40 63.76 53.26 63.46 58.45 43.87 NBS3 51.72 98.06 62.8832.95 59.49 98.06 72.22 44.77 NBS4 51.67 92.90 28.81 59.46 95.12 38.52NBS5 62.66 32.48 72.00 44.22 NBS6 31.53 41.18

[0192] TABLE 4 Pi9 Primers Name Seq 07.0kb-F1 TCCCAAATCTCAGGTGTCTT7.0kb-R1 ATCAGCAGGCGGCAAACT 7.0kbNBS-F1 GTAGGTACATCAAGGACGAG 7.0kbNBS-R1AGCATATCTTTGAGCATTTG 7.0kbNBS-F2 ACTGTTGTAGCGGAGGAGA 7.0kbNBS-R2TTTCCATTGTRGGCGAGGTG N2AR-F 5′ATGTCAGCCAAAATCAATCA N2AR-R5′GGAAGGGGACAAGGACAATA N2A19N-F 5′CTTTTCGTGGGNGGGGTTA N2A19N-R5′GGTGAGTGATGACAGCAACA B3NlC-F 5′CAGCTTTGAGGACATTCG 3′ B3N1C-R5′TTTTGACCCCAGACGACCAG 3′ B3N2A-F 5′GAGAGGGGATGGACAAAGAG 3′ B3N2A-R5′GGAAAACGAAACGGTAGATT 3′ B3N1CR-R 5′TGCCGCCTGCTCGTCCTC B3N1CR-F5′GGTTGTTTTCCCTTGTCC 75-19-F1 5′AAGGAAGATGAGCCGTGAT 75-19-R15′TGATGCGTGATGATTTTGTA 75-19-F2 TCTGATGTCCTCTGAACTGA 75-19-R2ACTGCCTCTGCTGTTGTTGA 19RF-F GGTCTGGCACTATTTTTACTTT 19RT-RGCAAAAGGAACTGATAAGAT 74RF-F ATCTCATTTTTAGGTTCTGTCG 74RF-RCAAGCAGCCACCACCATCTC 70RF-F CGACCGAACCGCCTTTAG 70RF-RTCAACGAAGAAGAGATGTAG 4RF-F AATACACACCCCAATCATACTG 4RF-RAAAAACAAGACGGCAGAACAGA 19RF-F1 AAGAAGATAATGGAATGGGG 19RF-R1GAGAGATTGCAGAGAAGAGA 19S-16 TTTCCTTCTTGTAACCTGAT 19N-29CTGCTTGCTATTCGTTCATC W8F-1 <5′taacagttctcccaatctcc> W8R1<5′ccggactaagtactggct> V14F-3, 5′GCCATGTTGCTGCGGAAAAT V14R-3,5′AAACTTAGGGCATTCAATCC A114F-2, 5′TGGTGCACTCAGAAAGAAT V14F-4,5′CGCCATGTTGCTGCGGAA V14R-4, 5′AGATCCCGCCACTAAACTTA A114F-3,5′CTTGTTGGTATGAGTATTCT A114R-2, 5′GCAGTGTCATCTTGTCTCC 19LEND-F1TTGAATACAGTGCTAAAGTG 19LEND-R1 GTATGACAATGGATGGAGAA 19LEND-F2TAGCAAGGGATGGGAGCAA 19LEND-R2 TAGGTGCACAGGAAGAGAAT 19LEND-F3TCGGTATTTGTTTGATTGGA 19LEND-R3 CGCGTGATATTCTTGACTGT 19REND-F1ATTAGGCACCCCAGGCTTTA 19REND-R1 TCCTTGTGGCGATTTGTATT 19REND-F2TCCAGGTAAAGAGAAAAGT 19REND-R2 TTGTACTAAATCAGAAGCAC 19REND-F3TGTTGGCATCTTTTATCTGA 19REND-R3 TAGCTGCTATTTGTGATGTA 7.0Prom-FGCACGCTTCTGTAACTCCA 7.0Prom-R ATTAGATTTGGCGATTATGC 7.0-3Prim-FTACACACCCCAATCATACTG 7.0-3Prim-R GCATCAGCAGGCGGCAAACT 74REND-F1ATAGAGTGAGATGGAGAAGA 74REND-R1 TTAAAGCCCGTCACCGATAG 74REND-F2GCACCCCAGGCTTTACACT 74REND-R2 GGGCNTAATTTCAGTTCTTT 74LEND-F1GTACTGGTGTGATTATGGTG 74LEND-R1 AAAAATACAAAGAAAAACC 31REND-F1CAGGCTTTACACTTTATGC 31REND-R1 CGATACGTGTTTTTCTGGAC 31REND-F2GCACATTAACAACAAAGAAA 31REND-R2 ATGTTCAGTTGGAGAGTAGC 7.0kbNBS-F1GTAGGTACATCAAGGACGAG NBS/LRR-R1 AGGTGTTCGCCCCGCAGGT NBS/LRR-F2CACTGTTGTAGCGGAGGAGA NBS/LRR-R2 CAGTACGCGATTTTCATTGTTC Pst.seq.1CGTGCTGATGTGATGCTTCTA 19-7-F1 ttcttcgtctttatcgcaac 3Prime-R1GGTGCAGCGGGGAGGAGGT BeforeH3-R GGTGCCCTCCTCAAATCTTATC 12L-R-FTTGGCGTCTTGAGGAGTCGTAT 12L-R-R GGCGGTGGTGGCGAGGAGTT AfterH3-FGGTGCAGCGGGGAGGAGGT AfterH3-F1 ATGTGCTGCAAGAGGAAGTGAA 195F-1TTGCTCCATCTTCCTCTGTT 195R-1 ATGGTCCTTTACTTTATTG 195F-FATACAGACCAGAGAAAGAAAAA 195F-R ACAGAGGAAGATGGAGCAAAGT

[0193]

1 97 1 20 DNA Oryza minuta 1 tcccaaatct caggtgtctt 20 2 18 DNA Oryzaminuta 2 atcagcaggc ggcaaact 18 3 20 DNA Oryza minuta 3 gtaggtacatcaaggacgag 20 4 20 DNA Oryza minuta 4 agcatatctt tgagcatttg 20 5 19 DNAOryza minuta 5 actgttgtag cggaggaga 19 6 20 DNA Oryza minuta 6tttccattgt rggcgaggtg 20 7 20 DNA Oryza minuta 7 atgtcagcca aaatcaatca20 8 20 DNA Oryza minuta 8 ggaaggggac aaggacaata 20 9 19 DNA Oryzaminuta misc_feature (12)..(12) n can be a, t, c or g. 9 cttttcgtgggnggggtta 19 10 20 DNA Oryza minuta 10 ggtgagtgat gacagcaaca 20 11 18DNA Oryza minuta 11 cagctttgag gacattcg 18 12 20 DNA Oryza minuta 12ttttgacccc agacgaccag 20 13 20 DNA Oryza minuta 13 gagaggggat ggacaaagag20 14 20 DNA Oryza minuta 14 ggaaaacgaa acggtagatt 20 15 18 DNA Oryzaminuta 15 tgccgcctgc tcgtcctc 18 16 18 DNA Oryza minuta 16 ggttgttttcccttgtcc 18 17 19 DNA Oryza minuta 17 aaggaagatg agccgtgat 19 18 20 DNAOryza minuta 18 tgatgcgtga tgattttgta 20 19 20 DNA Oryza minuta 19tctgatgtcc tctgaactga 20 20 20 DNA Oryza minuta 20 actgcctctg ctgttgttga20 21 22 DNA Oryza minuta 21 ggtctggcac tatttttact tt 22 22 20 DNA Oryzaminuta 22 gcaaaaggaa ctgataagat 20 23 22 DNA Oryza minuta 23 atctcatttttaggttctgt cg 22 24 20 DNA Oryza minuta 24 caagcagcca ccaccatctc 20 2518 DNA Oryza minuta 25 cgaccgaacc gcctttag 18 26 20 DNA Oryza minuta 26tcaacgaaga agagatgtag 20 27 22 DNA Oryza minuta 27 aatacacacc ccaatcatactg 22 28 22 DNA Oryza minuta 28 aaaaacaaga cggcagaaca ga 22 29 20 DNAOryza minuta 29 aagaagataa tggaatgggg 20 30 20 DNA Oryza minuta 30gagagattgc agagaagaga 20 31 20 DNA Oryza minuta 31 tttccttctt gtaacctgat20 32 20 DNA Oryza minuta 32 ctgcttgcta ttcgttcatc 20 33 20 DNA Oryzaminuta 33 taacagttct cccaatctcc 20 34 18 DNA Oryza minuta 34 ccggactaagtactggct 18 35 20 DNA Oryza minuta 35 gccatgttgc tgcggaaaat 20 36 20 DNAOryza minuta 36 aaacttaggg cattcaatcc 20 37 19 DNA Oryza minuta 37tggtgcactc agaaagaat 19 38 18 DNA Oryza minuta 38 cgccatgttg ctgcggaa 1839 20 DNA Oryza minuta 39 agatcccgcc actaaactta 20 40 20 DNA Oryzaminuta 40 cttgttggta tgagtattct 20 41 19 DNA Oryza minuta 41 gcagtgtcatcttgtctcc 19 42 20 DNA Oryza minuta 42 ttgaatacag tgctaaagtg 20 43 20DNA Oryza minuta 43 gtatgacaat ggatggagaa 20 44 19 DNA Oryza minuta 44tagcaaggga tgggagcaa 19 45 20 DNA Oryza minuta 45 taggtgcaca ggaagagaat20 46 20 DNA Oryza minuta 46 tcggtatttg tttgattgga 20 47 20 DNA Oryzaminuta 47 cgcgtgatat tcttgactgt 20 48 20 DNA Oryza minuta 48 attaggcaccccaggcttta 20 49 20 DNA Oryza minuta 49 tccttgtggc gatttgtatt 20 50 19DNA Oryza minuta 50 tccaggtaaa gagaaaagt 19 51 20 DNA Oryza minuta 51ttgtactaaa tcagaagcac 20 52 20 DNA Oryza minuta 52 tgttggcatc ttttatctga20 53 20 DNA Oryza minuta 53 tagctgctat ttgtgatgta 20 54 19 DNA Oryzaminuta 54 gcacgcttct gtaactcca 19 55 20 DNA Oryza minuta 55 attagatttggcgattatgc 20 56 20 DNA Oryza minuta 56 tacacacccc aatcatactg 20 57 20DNA Oryza minuta 57 gcatcagcag gcggcaaact 20 58 20 DNA Oryza minuta 58atagagtgag atggagaaga 20 59 20 DNA Oryza minuta 59 ttaaagcccg tcaccgatag20 60 19 DNA Oryza minuta 60 gcaccccagg ctttacact 19 61 20 DNA Oryzaminuta misc_feature (5)..(5) n can be a, g, t, or c. 61 gggcntaatttcagttcttt 20 62 20 DNA Oryza minuta 62 gtactggtgt gattatggtg 20 63 19DNA Oryza minuta 63 aaaaatacaa agaaaaacc 19 64 19 DNA Oryza minuta 64caggctttac actttatgc 19 65 20 DNA Oryza minuta 65 cgatacgtgt ttttctggac20 66 20 DNA Oryza minuta 66 gcacattaac aacaaagaaa 20 67 20 DNA Oryzaminuta 67 atgttcagtt ggagagtagc 20 68 19 DNA Oryza minuta 68 aggtgttcgccccgcaggt 19 69 20 DNA Oryza minuta 69 cactgttgta gcggaggaga 20 70 22DNA Oryza minuta 70 cagtacgcga ttttcattgt tc 22 71 21 DNA Oryza minuta71 cgtgctgatg tgatgcttct a 21 72 20 DNA Oryza minuta 72 ttcttcgtctttatcgcaac 20 73 19 DNA Oryza minuta 73 ggtgcagcgg ggaggaggt 19 74 22DNA Oryza minuta 74 ggtgccctcc tcaaatctta tc 22 75 22 DNA Oryza minuta75 ttggcgtctt gaggagtcgt at 22 76 20 DNA Oryza minuta 76 ggcggtggtggcgaggagtt 20 77 19 DNA Oryza minuta 77 ggtgcagcgg ggaggaggt 19 78 22DNA Oryza minuta 78 atgtgctgca agaggaagtg aa 22 79 20 DNA Oryza minuta79 ttgctccatc ttcctctgtt 20 80 20 DNA Oryza minuta 80 atggtcctttatctttattg 20 81 22 DNA Oryza minuta 81 atacagacca gagaaagaaa aa 22 8222 DNA Oryza minuta 82 acagaggaag atggagcaaa gt 22 83 76272 DNA Oryzaminuta 83 aagcttcggg cagctcccgg tggcgcggcg atggctgcag gctcccggctgcacgacgac 60 aactcccgcg cccgcgtcga ccgctgcagg cgccaggtag cgcgacgctcctggcggcgt 120 gtcgatagag ggggcgggac agtagcggcg cgcgcgagag gaggcgacggcgcgcaaggt 180 ggctggcggt ggtggagtgg tggtggacag cggtgcgccg tgcgcctggtggtggtggtt 240 gtgtgggcga ccgcgtgtgg ctgcagcgga gagccactgg aggagaggcgctcgactcga 300 tccggaggag tttgaccaaa gactagggtt agcttttata tatacctgttaattgggcat 360 tagtattatg ggccaaaacg tctaaaaagt ctagatttag tcatctatacagtaaagtcg 420 ggtcaccgcg ggggcgggga cggtgaatgg tgcaccgtcc ccacccccgtcatctccgat 480 ggtgctaact ttggaaccat tttattcacc attgtggata gatattaaccatcccagtca 540 cctaaaaggt gaattcaccg tgagaaaacg gtgaacgggg ccccattgccatctcttctt 600 tatatgtaca tgaaagctag ggttggagca tattgctcat gcctaaatccaaagatacaa 660 agcctatcca aaatggctcc atattcctct gcattctcca tccatctctccaaatttagc 720 tccatatttt tttaatgacc catgactctg ttttcaccta aattggctaaatacttgtct 780 tacttttagt tatttatgaa tacaatatgt taattactct taattattatttttataaat 840 tagttgctct taatgtatat atatatccaa ctactataag atagggagtgagtttggatc 900 ccaagattta aaaaggaaga gggggggggg gatggacaca gagtaggataagatagggag 960 tgggttaggg atatggatag agggggattt ttgggtgttt ctcttaaagagaaaagtcta 1020 tatttagtta tgcaactatg gctttggtat gtgtctaagg ctctgttcatttctggaagt 1080 tcccaacctc cacctcccat ttcccgccgc atgcttttta aactgttaaacgatgtattt 1140 ttaaaaaata tttaggaaag ttgttttaaa aaaatcatat taattcataagttttttatc 1200 taatacttac ttaacaatgc gttaattacc gttttgtttt ccgtgcgtggaggattagtt 1260 ccaacccacc ttgaagaaca tagcctaagg cggagtttgt ttcagcttttaggctgaaat 1320 tttgtgatgc tacgtgaaat tttgacgaga taagtcatta gcacgtgactaatcgaatct 1380 taattattac aaaattaaaa atatatattt atacataata attttgttaaatgaaacgta 1440 ccgtttagcg gttgggatgc gtgctcatag acaaccatga agtagctattccaaaacatt 1500 gcttagaacg caccctaaaa aagaccatgg ctttggtcca cttttgatttctaaatcatg 1560 aactcattta cttttggtcc atgaactctc aaaaccgttc gcatttcaccccataagttc 1620 aaaccaggat tgttttcaat gatgtaacgt caacatggag ttgaccagggtgatgatgat 1680 aaaataaaca atagcaaaat aaagaaaaaa aatactgtga cctacatatgtcaaacttgc 1740 ttcctttcct tctctctcca taatcctata tatgttattt actttaaattgctacctaaa 1800 atgacacatg atgccatgtc cttaaaaacc ctaccgtgat tatgtcaaggtttgaaatat 1860 gaggcggtta taagagttga ggggctaaca gttacagggt ttcatagttaagtgaacttt 1920 ttttcttctt aaaaaacata acagagagtg caaaacagaa taggaagtgtgatatagagg 1980 tggattgtat tgcctctact aaacgaatct ggaattttac attgtagaccttttaaaaac 2040 ttatgttaca aatagatggc tagaaaaatt tatttctaga atgagccctttttgcggcac 2100 caacgccatt attactatgg tccaacatga tggcaccaat gacattattgccatctccct 2160 tgccccttaa atggaagcta agggctagtt tggttcgtgg ccttaattgaccttgccaat 2220 atttgtcaat ttcaatagtg tttagtgtct atttggttat aagccaaattttgacatgcc 2280 taaagaaata gaccatttca atagtgaaat taggttgttt tgcttcaatccaaacacaac 2340 tttatcttac caagattagc catgccaaaa cttaccaaaa tttgacattgacaaaaattg 2400 gtaaggtcaa tttaggtcac aaagcaaaac agccctaagt ctcggatgtgaaggggttac 2460 gttcagaacc tatttgtaaa aaaattccaa aatactaaaa tataaattatgtttcttaca 2520 tataggtgtt taatgttaaa tgtaaccgag gacctattta taattttttttataaactat 2580 tcattctttt gaatttcttt tataaataag ttctcgacgc aagtgttattactgttgacg 2640 cccttcatat tagtgattta gcttttacgg tgaagagata caaggacaattatgctagcc 2700 ccgcgaaaat gatccatttc tagaataagt tttttcaaaa atttatttgtaaaataattt 2760 ttcgaaaata acaaaagtaa aagctgctaa acgaagagtt cctgcatttcaaacaataat 2820 aaactattag gaaaaacacg tgatccggtg acaacactag tttactcctctgtgcacgtg 2880 ggtccaggca ccggttaccg tttggtgttg tcctgttccg gttcctcggtgtccagtagt 2940 tccactgggt ggttggatgg tccgctgagc gcttggggcc cacgcgtagccttgttctct 3000 ggataagaaa actgcacttc catttctcgt gaaaatgtct actggtagaacagagtatgg 3060 gccggcccac taacttagcc taagtaacat acctcaatgg gctctcatagcccaataccc 3120 cactgctgat ttttttctta gggcatgttt aaattgctgg caaaataaacctttcgaaaa 3180 aaaaaatatt aagtcgataa cattatcaag ttttgatagg gtaagtttttttttgggatg 3240 tgtttaattt ggtgcccaca tataggggtg gaaaaaaaaa gctcgaagctcgcgactcga 3300 attgtctcgt ttcaggctca gcttggatcg actcgagttc ggagtcttaacaagtcgagc 3360 cgagacagct gttttgctcg ttaagctaac gagctgatcc cgagccagctcgcgagtcac 3420 tttgtaggct cgttaaactc tcatagccca agaatcctac aggccacagcccaatcaggt 3480 ggccgaaccc agtaaaaact ccctccccga cttcctgttt tctctaaaccctatcctatc 3540 ttccgcctcc accgaatcgc aggctcgcag gcggcgactt gcaaccaccaccaacaccaa 3600 gcaggctgtt gtcgtcgcct tctccttgtg ccgtcggagc gcgcccaccggctgctcctc 3660 ccgcagccgc agggccacgc tgacgacgcc gctcctcccg cagccgtagcaccatgccga 3720 ccacgccact cctcccacag ctgcaaggcc actcgctacc tcctagtccctggaggcttg 3780 aaccctgctg gtatgcttca tggatcagga gcgccggtta gcaagcatggaatccaagaa 3840 ccgtagcaag gggaagaaat tgtcaggagc ttctccctct agtttgactccagccggcag 3900 ccacctcgcc tctgttcaga agccaccaag acagagacct aaagcaagtccaccaacctc 3960 acatcaagca agtgtcgcat caatattttt cacaatttgt agcaattttttttcctaatt 4020 ttgtcactga ttgaaccatt caacttgtga tgcacaatct gtagtaagtgttctgtaggt 4080 ttgttttctt attctgtggt gcactgatga tggacatagt tatatattgaattggattat 4140 acaaggaccc aatggagggt tcattcagtt aacgagctaa acaagctcaatgagccagct 4200 cgagcaagct cgctgagttg agtcgagtca gcatttcagc tcgttaagcctaatgagccg 4260 agtcgagccg agatggcttt ttaaggcagc gagccaaacc gagcgagtcgagctgcctcg 4320 atatccaccc ctacccacat ataggccatc aaagtttggc aacatttgaaggttatgata 4380 ttttattgtg acgttgctaa attcctgtga gcattaccaa cgtttgtttacaaactaaat 4440 gtattaatat atcatttttc taaaaaaatg ttacgatttt aaatgtcatcaatatgaata 4500 aagcccttaa tctctcatcc tactggctgc cgtggcacct aatagcacatggcatgtcca 4560 cgttggcaac gagcgcgcca cgtaagtatg tgatgctgcg aaaaggacaaggccagacag 4620 agtcacagag ctagctgcgt ccagctaagc ctcgtgcacg taccgattccaatgccttct 4680 cctggaggta agtatttttg ttccacttga ttatatttct ttggtagatgacgtggacgc 4740 tgccacctgg atgctctgga atccagcgaa gtcagcagtc aaattaacaggtattttaca 4800 attttttttt ctgaattatt gtggtcgtgt tgcatcggta agagtaacaccaagcttaac 4860 tttccttgat cgatgctgct tttactcaaa cctggaagag atagtaatattttaattaat 4920 cagtgagaaa agaaaaactg atgtgaacgg gacaatttgc acgatgggggaataataata 4980 ttatttgggt tgggttttgt taaatactcc tacttgcttt gtcttttttctcgaccacat 5040 gaccaaatcg agctgtcaca actcgcacgt ccataaatta agaatattatgctaatgcag 5100 ttgacattcg acggcaacta gcaagacgga gtagctcacc aaatactattgctatatatc 5160 taattcaacc agcttggata aatggtgagc tgaatgcttg ctcaaacactcagactcacc 5220 aaccatatta tcatgtggct gtctatccag attgtcaaga ttttggtcatgtgatatatg 5280 atatctctat aatgcatgca tgccttttta atgaatagtt caatttcttacttttgcagg 5340 ccaatttgtt gtttgtgagt ggtcaatcta aaattaaatc taagataactttgataggat 5400 ggatgagtag attttttttt aactaaaatg tgaaattgat gtcaaatttgcatgataagt 5460 tgccgttcca acgaatagca ctagtaaaac acaatatttc gtgaatgcatgatgtttgaa 5520 aatgatgaag gtaaagatta agttgacgca cgtaaaacaa gaaagtcattagtgtataat 5580 taattaaatt ttaattatta taaatttgat aaatggatat atatgatattttaaagtaag 5640 ttctataata taaagttttt gcacggaatg tacatatagt agttttgataagtgctaatg 5700 ataaccgagc taaactctat aatcctaatc atagaacagt tcgtttagtgtgcatgagta 5760 gagtaggaac tcatgtgcta ttcagccagt ttagtcctac cacgttcgtttggagaacaa 5820 aggagtgagt ttgtttcgtt ttccgcgcgt acgcttcccg agttattgaacggtgcgttt 5880 tttgcaaaaa ttttctatag gaaaattact ttaaaaaatc atattaatctatttttgaag 5940 tttaaaatag ttaatactca attaatcatg cgctaatagc tcacatcgttttacgtatct 6000 tcccaatctt ctcctcactc acaaataaaa aaccaaaaca tgaagttagtatacctcaac 6060 taattaggtt cctgatagtg aaacccatcc atccagattt aagttttaaaattagcacgg 6120 gtgcttgtat ttacgagtag ttattctacc aaccaagttg tagacttgatacagtcgagt 6180 tctagacttc gcatatcaag atatgtcgga ttagtcttcg aaagtacttatagaggtagg 6240 gtgtcttaga catatatccg tctagtcttt gaaaatgttt atagggatggagtgcgtata 6300 tgtgtacatt cataagagtg atagtacgcg tgtttatatc agctgcgactgtactatatt 6360 ttaaaaaaac taatgcatcc tcatctaacg aactaaacct accagagaggggaggaggaa 6420 tggtaggaac actaaaaacc gaaaactttt agtggaatta aaagttaccttcgaagctga 6480 tggagatcgg tctgaccggc ttgattgccc tagtttgacc gcgccttgttgccaatctaa 6540 ccggtgtaga tcgaggtctg accgcccttc gcgccatcga cgcttgttgccgccatataa 6600 ctgccggtgt gaccgagcag ttgcctctgg tctgatcgct ggtgatctgtcggtgtgatc 6660 gctgcatctg agcaaaacac aaattaaaag atgtcttgaa agttcgtagattgaatttta 6720 ttgcttatat tcgtgttaca aagtacaaca acaacacttc tctcacagaattcgactaaa 6780 ctcgaaacct taacttttct ctaaattaaa ctctcccaaa atcgatataaggatatctca 6840 actccctctc tatttatacc caagacaagc agcctaaagt cacgaatctaatttgtacaa 6900 gaagttctaa ttcactagaa aaccttcccg tacaagaaac aaacttatctcatcataatt 6960 cgaatttcaa tcctcccaaa tttagactcc ttccaagttt gactccgcttttcatacgca 7020 cacaatctcc cattgaaacg gtcaattagg cctaaggggg ggggggtgaatgggctaatt 7080 taaaaactta agtaaatgca gaagcatggt ttttcagaaa ttctgaaaatgctttacgaa 7140 aattctgaaa atcacagaat atgcacaagt gaaagtaaat tctagatctagctacataca 7200 acaatgaaat acaagcacaa acaacaacta gacttataac ggtacaaacaagcaaagcta 7260 gaggagggag agaggtatat caccgaggtt gttgcaataa gagttgttcctgaagtttga 7320 atccttaagg ggattatact ttccgttgag gagctcacaa cgagccgggtctttgctaac 7380 ccttttctca aggggttgcc caaagcactc ctccttccac tagtggtatctaggggtgaa 7440 aacgatcgga atcgttagca tctcttcgga aacgatgctc gatcggccggttagtcctca 7500 catatactca aacgaaaatg ttagtcctct ctaatcatat tgttattaatcactaaaatc 7560 attaggggcc tagatgcact ttcacccatt ttatgcgaca aagaatcatcataaaacaat 7620 gtgcattgtt ctttagacta agcatcccgt acgatattct gattgtccagacatcatctt 7680 ctcccaagtt gactctcgat ccatcaccga caacgctctc ctgaggcatcaagacacact 7740 tacacaggaa tcaaaacaag aaaccatatc cgagcacaag ttttttcctaacttgactcg 7800 acattagcaa acaacaatat tacacacata tagaaataat ctagaagtcataattatgag 7860 ataaacatga gtatccaaat aaacaactca aaaccgaccc taatcaggatccagccagtc 7920 tgcccgcaca taccacaccg gtctgaccgg ccttaccggt acagtctgatcgacttcaca 7980 caaaaaaaac acttcaccaa taatcaccaa atactaaatt aattatctatcatgccaatt 8040 attcatcaca aattaataat aaaagcacac tttgatttca cacaattagttaactgacag 8100 tcatatcgat atatagccaa atatcggcaa taattgtaca agctccctacatctattctc 8160 ttaggttctc ctagatcgat cacactggtc tactccttcc atcccataaaatacaaacgt 8220 tatcccaaat gtgagacacc ataatactgc aaatctggag aggcgtgattaataatttct 8280 catcagccat aggggtaata ataatccaag gctgtgttta gttccacgtaaaaattgaaa 8340 gtttaaagaa attggaacga tgtgacgaaa aagttggaag tttgtttgtgtaaaaaagtt 8400 caatgtgacg aaaaagttaa atgtttgaag aaaaaagttt gactaaacagggccaagtca 8460 tggtgatggt aagtcagccc ctaagctttc actttgacct attctactctcacactgacg 8520 ttgacattgt atcacagggc accacccatt ccctctcctt aatttttgtttgactatagt 8580 aaccatatca atcaaatagg tgtgaaagct aacctagatt atttctgatcacactgatca 8640 cattctcact agctattcta cactttctac cttcacatag gacgacaagtgtaccaacct 8700 actgataaat tttcaagcct catttttctt ataagttatg gccaaaattaaaattttaaa 8760 aattagtttt aaattatttt gatgttgttt catcaccttt ttctagctttagctaaaatg 8820 atataatgat agagaaaaat aaaagtcata acctaaaaat aatttttagctcgtttttta 8880 tagcttatta gcagcagact attcaactcg accctccgtg ttttctcatataagcccata 8940 ccatggctac gtttgaaact acaactctac aagttctcat attttattttcttatttttc 9000 acaagtacgt ttttgaaact gtaaaacgac gtgtgtttct aaaattttctatagaaaaat 9060 tactttaaaa catcatataa atttatttta atttttttaa ctaatactacctctgtccta 9120 gaaagactgc agttttgcac tattcatatc caacgtttga ccgtccgttttatttgaaag 9180 ttttttataa ttagtatttt tattgatatt aaatgataaa ttatgaataatattttatgt 9240 gtgactattt tttaattttt ttaataaatt ttcaaacaag acggacgtggttagtatttt 9300 tattgttatt agatgataaa ttatgaataa tactttatgt gtgactatttttttttaaat 9360 ttgttttata aattttcaaa taagacagac agacaaacgt tggacaccaaaattcataac 9420 tacagtcaaa atgggacgga ggtagtaatt aattaacctt ttttaagaagaaatgagaag 9480 ttcgaacaca acccgcacac ccaaaccaag ggggtgttta gattgaggggtgtaaagttt 9540 tggcgtgtca catcggatat tatatatgat gttgtatagg gtgttcggacactaataaaa 9600 aaatcgactt attaagccta attaatccgt cattagcata tgtttactgtagcacaacat 9660 tatcaaatca tgtagcaatt aggcttaaaa gattcgtctc gcaaattagtaacaatctat 9720 ataattagct attttttagc ctatatttaa tatattatac agatgtccaaacgttcgaca 9780 cgacgggatc taaaattttc gggtgggctc atcgaaagca aacaaaacgaacccccgcac 9840 acgaacggtc acacgctgat cttttctcca aacccaaaaa agaaaagaaaatcagaaaaa 9900 gtcccgccac tacctctact actgccggcc ctacccccac agattccaggcgccagctaa 9960 gcacacaccg acgtgcaccc ctcccctccc gcatgcgttt ccactctgtctccgctccac 10020 acccgagccc ggtcaaaccc aaccccgccg cgccgtcact ccgcacccgactcactggcc 10080 cccaccaccc accaccgccg ctgccccgtg ggccccgccc ccacccgccgacgcgtgggc 10140 ccccacacgc cgccggccgc ggcgggtcac atgcgcgtcg tccaaactccgaggcgcgca 10200 cgaaaaagag agagacaaaa aaaaagaaaa gagaaattct atttaggcgctctaggtata 10260 aaaatctact cctactcatc aactactttt ccacttcgat tcctctccccttccccctcc 10320 tcttcctccc tctcctctcc tctcctctcc gccgctgccg ctgctgctgcgtgctcctct 10380 catccccgtc tcttccccct ccgcgcgcgc cgcccactcg ctgggaggaggaggaagagg 10440 agaccttccc cggaattcgt gctcgccgga tcgggctcgc cgcaatccatgtcggtgagt 10500 ggtgctgttg atgtgtttct tttttctact gattttagag ttggtgattggttgcgctgt 10560 ctagatcgag ctgacccgcc tgcgtgttgg attgtattga ttagaggaagggaggctgat 10620 tagcacgagg tgtggggaaa aattagttgt aaaaaaaatt ggaggggttaatagatgggc 10680 gtttgttaag tgacgtaagg cgaaagtgat gttatgctgt tctgggttagtaaggttctt 10740 ggcaatcagt tttggtcaaa tcttactggt tcataggtgt tttggtcgaatttcagcttc 10800 agatgaagtt tgtccctcgt ttccaggatt gttggcggtc agttcttcatcgttaggcat 10860 ttaattggtt gaacaggaaa ttggggtcat ggtagaatgc gaagtttctgaaaacataag 10920 tagagaacaa aagtaggaga attttgtcaa ttaggtacga tggaaggtcaccgtcagctt 10980 gttcaactac tgttaggttc aaactttgat cgtgcttgag atttctttgctaaaataagg 11040 ggatatgtgg caaagggaga ataaaccatg atatgatttt tgttactgactatagtggct 11100 gtaaatgaga acttgtggaa tcacagaaag ttccaaaaaa caatgcagaaattgtatcct 11160 tttttgtgct ttgcctagtc cattttctga cttctgagat gaggtccccgcatcacatag 11220 aactgcaatt gcaattcaat gcattcatgc aacaaacaga cagactactgtcctataaca 11280 tgagttatga cttacgagta gccttgcata ttgtacatgc ataagccaaagctgcttgtt 11340 tattgtaaac cagatactct gtaccataaa aaccataatc cattaagttttcttgtttac 11400 tttgttgaac attatactag atacctatgg tgttgactgt tacataacattctgttcacc 11460 atttggcaat tttcccccct tagtgacatt cattcttttg agatttcagtcatgccttcc 11520 acatgacaat atgttgcatt caactatgat gtttggaatt gggaaacggtactcatactt 11580 aatgtttctt ttactaattc tgattgaaag catgcaacac accttttaatccacatgcac 11640 atacatcaat caatgctttg gagagagtgc atcatggatc aatcatattgatggtttatt 11700 agtaaacatt ttacagcaat tatttgtgca catggccctc atacaatttttgttgggcat 11760 tcaacccttg ttttgaggtg ttaaataaat tattttttat tatgttgtcaaattgcctga 11820 cctttgtatt ggtggttggg catctagctg tgcactgtca ataactcctggcggtgctcc 11880 tctttttcct ggctgttgaa cttccaatct gaccattcat tatgttcatatcttgtacac 11940 atgtggtgag atgtcaattg ctgtccattg tatgttatta tattttacttagcattgttc 12000 aactgtaata gacagtgata tatcatacat atcaaatgaa accaacacagacaaagcata 12060 tgctgtgttg tcatcttctt atttcttgta ctgttctgtt ctgtcttgaaggttagctgt 12120 gttagtcatg gcccactatg gattctcaaa aggcatcctt tatgttcattagtttcatct 12180 ttttctattt tccaatttta ttagatggct atacacgact ttgttcacacataatctcta 12240 attatcaatc gtctggtcat gcttggcagg cctcaatcaa atgccagcccacccatggga 12300 aatgggccag gattgaggcc tgccgaacaa ggtgtagagg tagatacaccctttaatgtg 12360 gatcatggtt cctttctagg aggaagcagg tcgacccttg cccaagtttggtgaatggga 12420 tgtcaacgac ccagcttctg ctgatggatt cacagtgata ttcaacaaagccagagatga 12480 gaaaaagggt gggaatgggc aagatactga ttcaccctgc aaagagactaggactgagag 12540 ggtggaatca tatgccccca aaacaaactc ggtatgtcta ttacattactcacgctgatt 12600 ataatatcag aatttcaatt tctccctgat taatataaag ttattccttttctcttgaac 12660 tagataacta gctagatgtc attaatctca ttgttttcat gtgcagaagaaatggttttg 12720 ctgtgtgaca tccagtccta cacaatcttg atgaaaacga gttccatgggttgcaaaatt 12780 actatccttt aattttgcta tatacatact atccataaga ccttgtagagatgcccagac 12840 tctgctgtgg tgcttgattg ggcatctctt aaaactctga ggtgtgtgtatgtatgtgtg 12900 agggttatca gatgcacatt cggataaatg aacttctgat tgtaattctagccttccgtc 12960 ctgtgacatt ctgtgatgta gtcgttattc agtgatataa tgaatctcacctattgatat 13020 aactgcacac tgtttctctt gctttccgag gaaacaaaca ctgttttaccaaagttggct 13080 gtgctccaaa tgaggattaa attccgtatg attctcaaat tgcaattgcaattttgattt 13140 tctcacccat taatatgtga atttggaatt ttctttcacg tgtcaaattcgcatattgca 13200 atttttctaa aggatacaag cacagtatag ctcaatctct tgcaggctgagttggtagca 13260 atgcaattgg cgatgggagt tttcttaata ttgtaggcat cacatttctaacacacaatg 13320 caacggtgta tcgactaggt gataaggtag caaaggatgt gagatcgcaacagctcaatc 13380 cttcagttta tgattgtcaa aatatcttga gtcttgctca ccctagaaaccttcgttatg 13440 cttgagattt gaaaaagagc aacgaattgt attacaagtc atgtgctctgtttctgaaat 13500 ggaaataata ctgacgagtt ctaaagaaaa aaagaaccat gcattcatgcctgtacaaca 13560 gtagttggta ctgccatgtg ccatctgctt cagatgagtt tcagaccctgcagcacggac 13620 agcaaatcta ctccctccgt ctcttaaaaa aacatcctgc ctctcctaatacaacgaatc 13680 taacatcaca tcttttctag gctacttttt ttttttttgg cgaagagagtacgtgtcaga 13740 gtggagattg ctatattggt ttcagaattt catcagtgtg cccatgcccaatactagtat 13800 tattcgattt tcagtgagga acaaatgcca gcatactaag ggcctgtttattttgatgcc 13860 attttcaacc ttattaaatt ttgataaagt tgctaaaaaa gtggctatatttagtttgct 13920 gctaaatttt ggtaactata taagaaatcc tgccaaaatt ttggcaagtctatgccctaa 13980 tatttcgtgt tgccaacaca caatttcact tttttttaca cgatttagttcaggctgtgt 14040 ttagatctaa agtttggatc caaacttcag tcctttttca tcacatcaacatatcataca 14100 cacacaactt ttcagtcaca tcatcttcaa ttttaaccaa tatccaaactttgcgctgaa 14160 ctaaacacag cctcagtttc tctgaaattt gcttgggatt cattcgttcgagaagaggcg 14220 tgagatccga aactttggga gagggaaagg gcagttcaat tgttcaaaatacctgatact 14280 atttgctcac tagtagctcg cgtggagctt gagagagaga gaggggagggagagtagtgt 14340 gagggtggga gcaatcgatg gcggagacgg tggtgagcat ggcgatgtcggtgctgggca 14400 gcgccgtcgg gaaggccgcc tccgccgccg ccgacgaggc caccctcctgctcggcgtcc 14460 agaaggagat ctggtaagtg cgtgcacgct tctgtaactc cataccacatcggtgactct 14520 ccatggcccc tccgcggcgg cgtgtcggcg acctcggctt ccatggccgccgccgccgcc 14580 gcctcacagc cctgcttcgc cgcaatttct tttttggtct tttggtgattccaacggatt 14640 caggatcaaa aatgaaacaa tccggagagc ctattcacgc ttcgcatttcactaccccaa 14700 acttgtggat tgtgaccatg caaaccaaaa ttccttcagt ttcatactttcatatatgct 14760 caaatttccg ttgttttcgt agaaacaagg ttatttatcc cacttacatcttgcaaaaac 14820 caaagtacag ttactcttat cgattttcat ctatacacat gctagacgaatgcattgtgg 14880 tccaattcaa atttctgata tattagctac taaccaagat gctcccatgccatgagcgca 14940 agttccattg acaaggaaaa acactttttt atgatttgag caagttccacgctggcgtcc 15000 tcgttggcac accacgtaag ctaaagtggc aaaaaaaaga aaagaatttttctctctaaa 15060 gtgagtcagt gacaaaacct taagtgcaaa attaaaatgg gtatttggataatcgccaaa 15120 tctaaaagtg gcaaatagtt aaattcccct tctatatttc tctcttcagtggcaaaaact 15180 taagtgcaaa aataaaatgg gtatttgcat aatcgccaaa tctaatagtggcaaatagtt 15240 aaattcccct tttatattag tccttgagga gctacggcaa tgtttgctagcaccggacaa 15300 gatcatccaa taaaaactta gagattatac ataacagtgt cactgttactagcaaaattt 15360 tccttcctat tgcatggtta gtacattttt ttttgtgaga aatatatggtcaatgccatt 15420 tgcaatatac ctccaggctc tagtcaatat aagactacat acatatgcgatctacatttt 15480 gtactaggtc tctaccaaaa gaaggggttt gtacccttct gttctcttaaattaatgaga 15540 cgatttgtag gatctttctt tttcaaggga ttgtgggagt tctgactctcatggctaacg 15600 aggcgctctg aatactagta gtttatttcc atccatctat ttatttattttaataaattt 15660 tattgctttt gaatttatat ttgtaggtac atcaaggacg agctgaaaactattcaggca 15720 ttcttaagag ctgctgaagt aacaaagaag aaagatgact tgctaaaggtatgggcagag 15780 caagtacgag atctgtcata taacattgaa gattgcctag acgaattcaaggttcatgtt 15840 gagagccaaa gcttggcaaa gcaactaatg aagcttggtg aacgccatcgaattgctgta 15900 cagattctca acttaaaatc aagaattgaa gaagtgagca acaggaatacacgctacagc 15960 ttaatcaagc tcatttcctc tataaccaca gaggatgaga gggattcctacctagaagat 16020 gctcgcaatc gatcaggtag caacactgac gagtcagaac ttgtgggctttgccaagact 16080 aaagatgagt tgcttaaact gatagatgtc aatactaatg acggtccagctaaagtgata 16140 tgtgtggttg gaatgggtgg attaggcaag actacccttg caaggaaggcatatgaaaac 16200 aaggaacaca tgaagaactt ctcgtgttgt gcttggatca ctgtgtctcagtcatttgac 16260 aggaaagaaa ttctgaaaca aatgatcagg caacttctgg gtgctgattcattagacaaa 16320 ctcttgaaag aatttagtga gaagttgctc gtgcaagtcc agcatctcgctgatcacttg 16380 gttgaagggc taaaggagaa aaggtacttt gttgtccttg atgacctatggaccatagat 16440 gcatggaatt ggattcatga tactgctttt ccgaagatta acaacagaggtagtcgcata 16500 ataataacaa cgcgagatgc tggcttagct ggaaggtgta cctctgaatcacttatttac 16560 caccttgaac cgttacatat agatgatgct atacacttgc tactagcaaagacaaacata 16620 agacttgaag acatggaaaa tgatgaggac ttgggcagca tagttacaaaattggtaaaa 16680 aggtgtggtt atttaccgct ggctatactc acaataggag gcattcttgctactaagaag 16740 ataatggaat ggggaaaatt ttacagagaa cttccttcag agcttgagagcaatccaagc 16800 ctagaagcca tgaggaggat ggtgacccta agctacaatc acttaccatctcatcttaaa 16860 ccatgctttc tttacctaag tattttccct gaagattttg aaattcaaagagggcgcctg 16920 gtagatagat ggatagcaga gggttttgtc agagccacag atggggtgaacattgaggat 16980 gttggaaata gtcactttaa tgagcttatc aacagaagtc tgattcagccctcaaaagtt 17040 agtacagatg gagttgttaa gagatgtcga atccatgata tcatgcgtgatatcatagtt 17100 tcaatttcta gagaggaaaa ttttgtgctg ttgactaggg agaagatcactgttgtagcg 17160 gaggagagca tccgccatct agcatttcat gggagcaaat gctcaaagatatgcttggag 17220 tggaaccatc tgcgctcagt aactttgttt ggtgacagac ctgcggggcgaacacctgca 17280 ctttgttcac cacaatttag gatgctgaga gtgttggatc tggaagatgcaaaattcaaa 17340 ttcacacaaa atgatatcag gaatataggg ttgttgcgcc acatgaaatatttgaatttt 17400 gcaagagcct caactattta tacacttcca aggtccatag gaaaattgcagtgcttgcaa 17460 attttgaaca tgagggaggc aaatatctca gcactaacaa ctgaggtgactaaactccag 17520 aatctccgta gcctccgatg cagcaggagg tcaggttctg gttactttagcataatagat 17580 aatcccaagg aatgcttgat gatcaccatg tgcttaccga tggttttctcaacttcaata 17640 aatttcagtg accgtgtgaa gttaattcct gagatatgca tgtcatgttctacccgttgg 17700 tctgatacaa agggtgtgag ggtgccaaga ggaattgaca acctaaaagagttacagatt 17760 ctagaagtcg tggacatcaa cagaactagt aggaaggcga ttgaagagctgggggagcta 17820 attcagttaa gaaaattaag cgtgacaaca aaaggcgcca caaataagaagtatcagata 17880 ttttgtgcag cgattgagaa gctctcttct ctgcaatctc tccgtgtggatgctgaggga 17940 ttctcagata ctggaacact tgagtggctc aattcgattg catgtcctcctccattcttg 18000 aagacactca agttgaatgg atctcttgca gatacaccaa actggtttgggaaccttaag 18060 cagctggtga agatgtgctt atccagatgt gtgctacaag atggtaaaactatggagata 18120 cttggggcac tgcccaacct tatggttctt cgtctttatc gcaacgcatatgctgacgag 18180 aaaatggcat tcagaagggg aacattccca aatctcaggt gtcttgatatttacttgctg 18240 aagcaactta gagagataag atttgaggag ggcacctcgc caacaatggaaagtatagaa 18300 atttatggtt gcaggttgga atcagggatt attggtatca agcaccttccaagacttaag 18360 attatttcgc ttgaatatga tggtaaagtc gcgaagcttg atgtgctgcaagaggaagtg 18420 aatacacacc ccaatcatac tgaactgcaa atggcagagg atcgaagtcatcatgaccta 18480 ggaggtatat tatataagga tcaaatatag catatgaaca atgaaaatcgcgtactgagc 18540 ttgccattga cagatcatgg tcctgaccag gccgttgcat ctgatggcgatgatgcccat 18600 gacaatccag tgctgcgcta tcaaacatca tgttagacat caagagttcagtcttgcact 18660 tgacttgaga acctacccct gttatgtaat tcttacctga catattcatctgaaattcat 18720 ctgttctgcc gtcttgtttt ttctccctct ctccagattg tgaccatacattttctttta 18780 ctacatcttt gcaccccaaa gcgaagatac gtgcgtgctg atgtgatgcttctagcttgt 18840 aattggtgat ttctcctctc ttctccagct cagtttgccg cctgctgatgctatctctaa 18900 tatcgaattc tatgcggatt cccgtgcaaa tagcacctcc acgtggcgcggccagattca 18960 cggattcgct cccacgcaca gccgtccgat tgcatttcaa cggtgagagatcgttgtggt 19020 ggagcctgtt gtaatatacg agaatcgagg ggttttctta caaaagcactcaatcccccc 19080 atcatccgtc tcctctccag cgtcaccgcc gcctgattca gcacatccgccgcgtcaccg 19140 ccgctgacct tcgccctgac ctcctccccg ctgcaccgaa gcctctctatcccctgcgcc 19200 tccacatgtc gcggctacgc gggcatctcg ttggatctat catcggcggcgaggaggcgt 19260 ggcgatggtg ccgaggttca tcggccggtg gaggaacatc gtcgacttgaggaccgtgcc 19320 caggttccgc atggtcccca actactgcca ccccgtggga gccgccgaggggggcaacaa 19380 ctatggcttt cagatgatag tttgatcatg cacacaatgc gttcgacgaaatgtccatgg 19440 gcattcatgg gtgtggctgg cgccctgatg atgctcaatc ctaccttcctcgttggatta 19500 ctttggtgtt ggtcgaagca ttggccgacc aaggaatgtg ctgccaggagttacaatggt 19560 ggactcacgt cttagagggc agttcccttc tgactcatat ggtgcgatgacataccgtgg 19620 cgtgccgtga aaacctgaga ttggcaaatg gcttactaag tgtgatgccggcgtgtcttt 19680 gtcactgacc aattcggatt gcttgagctt tccctggatt aatttggttgttcttctgaa 19740 acggtttata ggtgcaaagg ggtgcgagca agactacaat ggacaaggtgtgtgcaacta 19800 tgagctggga taatggagat gctttcatgg atatgttggt aagtactactttgtcagctc 19860 aattaatgtg ttaatcttgt tcacacttga ttaatgaaat tgcgatttcacatggaattt 19920 aacaaggttg ggttttaatc acactccatt ttttagcacc atccaaaaaatctttaagat 19980 tgttttagta atttatctac tccgtacata atatcaagat aaataggcatagtttatgaa 20040 cctttgaatt ctgcacctgt tccatactaa gaagcaaata agaaagaaacaaataaatac 20100 tccagttggt tagtgtgttt gctcatatat gccattctta agaaaaacatcagtttatat 20160 atttatttat gtcctattta gtttgctgtc tatcaactct tcaaaaattatggataaact 20220 gtgctacttc aattctttgc ttttgatcaa atgacagccc ttgctctttccaacaactct 20280 attgacaaag atgttaaaca tattttgccc aacactacaa cttttgctccttagttctgg 20340 aaatgatatt gtatactcta taaaccaagc taattcttca tttgacttatggatatacca 20400 tcatagaaat agtataagga gacagttgca tattattact acacttattttttgaatata 20460 tatttcatca ttgtacatat atatatttag acagctcaat tatatatctttctgacaatt 20520 gctaacacaa tatttattat aacaaagttt gcttggaaat aacataataaaggatataat 20580 gaggcaataa gaactgtgcg tggagggccg gaagatgatg tctgtcatggccggcaagat 20640 ggacgagctc tggatagggc gtatggagtg gtgcgtgctg gaatgtaacaagaacgccat 20700 caacttctac gagggaatgg gcgagccacg caggtgtgct cccgtaatggcacatataca 20760 gactcactgg cgctgcacac tggaaagtac aatgggacta agagaagggcaatggggcga 20820 aatagatcgg tggttgattc ttagttgtcc agtctcaagt aataaagaaaccggagatgt 20880 gtgatttagg gtattggata agttcgtgat gtcttagctt gatgttctttcaaatgtagt 20940 acacaatctt tgcagaaggt taagggaaaa taaacaaatt cttaaattataggaaaaggc 21000 aaaaaaagat aattcaggag agtcacaagc cccattgcat aacaagcccaaaactgaatg 21060 ttggagacac tctagatcaa aacgttagca agcggaaatg gtatactgttatacctaatt 21120 ttataaaatt attttatctc taaccttttt acatttcctt tttaccttgttgcaattact 21180 gacaatcacc caattactgg actttgcaga tgacatcgaa aaaatcctaggaaacactaa 21240 caagcatcca ttcaggacaa tacatttttt ttcatgctag caatggtgttgtttttttct 21300 tgcacaattt tcatctcaag tcccaacgac tatttattta attcgttattaaggaaaaaa 21360 ttgatggatg ctttatctag acctctatgt acactaatat caaacaaatctaagctatag 21420 ataaattaga attagattgg cctaatatca attgggcgtg gcaaagacgcgccattattt 21480 ctagtgctta tactattata atgtccaagt tttgagtcga tctgatgtcctctgaagtga 21540 tcatgtcgag gtaaagtggt cagtccatgc acagtattct tctgtgatgtcttgcagttt 21600 ctgcttccaa ctgatccgtg acgtgatgtc caaatatttg tgttaatatatgtcagaatg 21660 tcacatatac caaattccta gtcgcagtta aacggcaaga aattatgcagtttggtgcac 21720 tattcttctg aatctagcgt acacctcatt tcaaggagtg acacggcttgccaggtgacg 21780 gatgacttgg tggcggacgc aggatttttg agatgggtat tcgagacatacataggaaaa 21840 agaatatatg tgacatcgca agctatatat tacattggga aaaaaattataaatctcatt 21900 cagcagaggt ttctttgaag gacggcagca acggaggaat ctgtactcatcaggttacaa 21960 gaaggaaata atccagtgtt aacttcagag agaagaaaag gtaggtaaagcacactcgtt 22020 ctttgtgcta attatgctgg tttattcata tttatttctc aaagttaaaggtacttcatc 22080 ttgagcgacg caagtacaac aaagcaaggt gttttgtatt ggaaagtacagatgtacagc 22140 tagcaatgca aaagagaagg cagaggaaac tcagaactca caagggtacgtaggtccact 22200 ataaatccct cgaccagtcg acctgcctct gctgttgttg acaggagctaccggcttact 22260 ctgggtttat tcttacaaat tttccagatt agagaaggtc caagagagtgtgatcgggta 22320 agtgaaacct gatttttttt tttcaaaagt ttctatttgc atctcttgaacatttttaat 22380 cgtatgtctg tcagtttggt gtatatccac attgtttata atctgttgaaatggcagact 22440 aagtactggc ttcgatacac tctaatcaag caagtttaaa ccaaaaaaaatacaattaag 22500 agaggatgct ggtgatggcg ttgttttgag tttggagagg gtcattatggaggatgccat 22560 caggaagccc aacaccgacg gtgtgagtaa cctttgggtt tagtggaaatccggcaaaat 22620 cttgcaaggt gattagtgag tgaagccatt ttcagttggt gcttttctttgattatctga 22680 tttttaacag attttctttt gttcagcata agatagcaga tgacatgaactgttggtatg 22740 ggtcatgggg gaaacagtgc tcagcattgc gtcgtccaac tgccacaaatgacatgagct 22800 tgctgctggg agtgcaggag acatggtaag catctagctc ccacataataagattttcct 22860 tttcataact gtgcctagat ttattggaac catgcacagt tgttatgggtcattggagat 22920 tgggagaact gttattgaga ggcagaaggt gtggaggttc ttgttgagtatataagcacg 22980 tgtaggcacg tttaatttat tcgataaata aatcacgaca ttgaacgtgtatactagcat 23040 tatcgcatcg tactatttac acatgtaaac taggcatgaa atcatgaacatatcgaatat 23100 gcaaaacatg gtgaatgcgt actgagggtt ctagctgcac agtacgaaaacacatggtac 23160 tggacattga cgacgacgcg tagcacttga gcgaaaagga agatgagccgtgatgaacga 23220 atagcaagca gtcatgcaaa gcgcttccca aaaaccttat tgtcaccttctcctggtgca 23280 ggacgttgaa tgcgaaggtt ccggagagta gctctcccga tcaccgatgcacgccgacga 23340 tcggggcaga gtagactacg agcgacgatg tagtatagag gagacaaatcttagattgat 23400 ttcacatatg ttgcgaggag gccgtagctt gatttatgta gagatatcaagacgcgtgat 23460 taggacgcct gcatgatctc cataccatgt aaccgaaccg gataagtcctgtgtaactta 23520 tccagactcc acgccgatcc acgcaccgga ttaatcggtg tgtttccaaaaactaggttt 23580 ccaaaaacga aaccgaattt ctgcagcaac acaaaactgc aaaaggaagccgcatctacg 23640 caagggtgag gaaccaattt tgcagacatt cacgctcatg ccatgtacacaagccgtttt 23700 tataacagga atcattcgtg ttaattattg tgccattgcc cttgatcaagtagtctggta 23760 ctcacgatta cattgctgga atatccagtg tacatacata tagcatcttgcgtagattat 23820 tacatcatag agcccgtagg catataatag aacatacgac caaacggtccagaatacatt 23880 caaagacagt gattagataa ggcaccggag cttaggtagt gacaacgatcgtaactaaga 23940 cctattgggc gccatcatct tcagcaacat tcagatagaa cttataagggtcctcagtga 24000 cttcattatc cacctctgat tgattgtaaa tagtaagggt gagtaccaactgtactcaac 24060 aagccaacac aacaataatg cacatgataa gggaattcag atggggtggctaagggatat 24120 tttgcataaa gccaattttg taattctttt ctcaagccta ggacctaccatagacttatc 24180 aagttttaat ccattattta ttaactagtg acgtttctgt cctaccatccattgtgatcc 24240 caaggatagc ttcccgccac tgtattgtca tggttttcta gggtctaccccttctcgcca 24300 cttgagaagt tgatccacta gtacaaaatc atcacgcatc atcccacccatcaatttaag 24360 aatttgagtc tagccaagtg taatacatgt cccggtgctc attatccgcaagcatggcta 24420 tttgaataga ttggtttact cacactgcag tggatgtacg ctttaacccgcacctcttga 24480 cttgcccaac aaatgagccg aagtcccaac gaatgaggtg catcacgacaaagcctttta 24540 ataacctcac tttggtagta cccgttccat gaacttaagc cctcatgcactttaggcgta 24600 cacctgtttc tagcagagag aagagttctg gtgttcccga aagggataaactcacacatg 24660 cattgaatta tatttaagtc aagttttgtt taggttaccc actgatggttatccatctgt 24720 aggctccccg cctattcaca cacctacatg aatccaccac atcttgggaattggctctgc 24780 tcggctattt actctgctag atccatactc aatgcgtgaa gtacggttgtagtggggttg 24840 tttcaaaatc agtctttaga acaacaatat taatctgaga aggaaattcatatggattcg 24900 caacttttta ccaaatgtca taactgttaa acataatata agattgcatctttttgcacc 24960 aagaaggaaa atcaactcta tcataaacat gtggcttggt cgaaaatctttcagccatgg 25020 caaaaactca agatcagtaa agatcaaaaa gagcaaacgt gactctgataccacttgtag 25080 gatcgaacac aaacaaccaa acctaccagg gggtgaatgg tagacatcggtcacactgct 25140 agattgccgt cggtttgatc gtggtaccct cgctggtctg actgccgccttgctactggt 25200 tggaccgcct ccctgtgccg gttagattgc tcgtcagatg ctgttctgaccgtcgggatc 25260 tagaaaaaca cgtctcgacg aaactcttga aattagatca actttattgcatcaatatgt 25320 gtttacaaag tgtatctaaa gcacccctct cccacctcac aaactagaattgaagcctcg 25380 aagcaaaccg taacttttct ttcccaaaat gtagatatac ttctagtctcaacccctctt 25440 ttatttatac ccaaaacatg gctgccccta gccacaaatc caactagaactaggactcct 25500 aatcacatag gaaacctacc aaaccaggga aacaacattc ccgtacaacctcacgctcca 25560 atccttccaa acttggactc caatcaaaat ttaactcttt ttttcatacgcacacaacac 25620 cacgtgtatg ccatatgaaa accttcacct ccacatgcac tgttctctagcctaagcatc 25680 ccgtatgata tccttgagcg tctagacctg aacttgtccc aaagcctagccccgatccat 25740 caccggctat actcccatgg ccttaatcaa cacctgcaca tgaacaacaaagaaagttca 25800 tatccgagta caagttcaca ccaacttaac ttacatcagc acacaatagtatcacatacg 25860 aatagaaacc atctagaagt cataaacatg aaacaatcta gaaattccacgaaaacaacc 25920 cgaaactgac tcaaaactgg agtatgccga tctgacagtt gggctacccggtcaaatcct 25980 ccggacacct ccggtctgac cggcgcccac tgcccggtct gaccggccacctagtgccaa 26040 caacatccga aaacactatt caccgtactt tcaacaaact cttacctttgataatcgttc 26100 atcaccacga ttaaatcatt acatgatttt acaccttggt gacatgaaaatgattcatcc 26160 aaaagtcacc atggatggga ggaggaagac cacatcttgg gctatattgggctctatttt 26220 agtaggaatg gattctaatc tctcgaaggg gtgccacatt attattgcatgctatctaaa 26280 tagttacgaa ccaatttgaa aagacaattg aaaatataga ttaatatgtaatatatcact 26340 tcacagacat gcaagtttaa attcaacctc tgaacgttct aacaaaaagaacaaattaaa 26400 ctctattcac agttgtcttt ctttctttcg ctataatttg caaagattgaatttgaactt 26460 acatgtttgt gatgtgatat atcacatatt aatctatctt gtcaattgtttctaattttt 26520 gcataactat ttagatgaca tggaaaaaac gaggggatat ttccttgagagattaaaata 26580 gtttccctac tttagctgat tgttttcaag ggccttagtc agtttgctggcagcagagca 26640 ttgttaccag agcatatgtc tgatccaatt tgcatgatta tagtaaactgtttttttaat 26700 tttagcaata tctgcataga taacagccaa agtaaacaaa aaatgggcttcagaatgctt 26760 gtactttagt attaatttag aattatattt agattggaac attgctatgtgcattttctt 26820 gtgtagtttt gtgacttcca tgatcacacc tctctatttt catcttggatttgccacttg 26880 cccattgtaa tgaggattca ttgttttcat ttcatacgta tagtatgataatggagtttc 26940 tgttccattt tattcaggta aatgaatgag agtaccttat cgcaagaaaacctaaagaac 27000 tccaccatca gtgatcttat atattattga tccaagaatt gatttggaggaaggagatga 27060 ctgagacggt ggtaagcatg gccatgtcgc tggtcgggag tgccatcagagtggcttctt 27120 ctgcttccag ccaagagatg agcatgctga ttggtgtgca aaacgagatttggtacatac 27180 tctggatgca acttcctcat ccttcatcaa gttcttgatt ctcttgtgacaaccactttt 27240 ttccgcaata gcttcaacta gaaatgcgat tgcttagtga ttgacaggtaatatgtgatg 27300 gcaatctttg aactgtgatt gacaggttca taaaagatga gctgaagacaatgcaggcat 27360 tcctgagggc ggctgaagta aggaaagatc aagatgaact cgtgaaggtgtgggctgagc 27420 aagtgagaga tctagcctac gacattgaag attgtcttca ggagtttgctgtccatgttg 27480 ggcatcaaag cctatcaatg cagctgatta agctctgcca ccgacaccgaatcgctgtcc 27540 agattcggaa cctgaaacag agggttgaag aagtgagcaa tagaaacatgcgttacaatc 27600 taatcaagtc tgttccatct agcagtaccg atgactccca aagcaacatggaattgattc 27660 ggtatcagac tgctcattat gttaatgaat cagagcttgt tgggtttgacgttcccaaaa 27720 aggagatcct ggagttggta tcaagcagtg aaaatgctga agcccaaacaatttggattg 27780 tcggagaagg tggtcttggg aagactactc ttgcaaagaa ggtttatgaaagctcagatg 27840 ttagtagcaa gttcccgtgt cgtgcttgga tcactgtgtc acgatcatttaatgtcaagg 27900 atttactgaa gaacatgatc aagcaattac taggggaaga ctcactcaagaaactcttgg 27960 atgaacacaa agaggtgatt gtagaaaagc acaaccttac agaccacctgaacaaagggc 28020 ttagagggag aaggtatttt cttgttcttg atgacttgtg gaccacacaagcatgggatt 28080 gcattaaacc tacttcctgg ggaaataaca tggaaggtag ccgagttgtagtaacaacaa 28140 gaaacaaaaa cctagcaggt ggtagttcca cctcgccagt ctactgccttaaaactctag 28200 agaaagaaga tgctaccaaa ttgttgttaa gaaagacaaa aagaagtctgcatgacatag 28260 aaaaagatca aatgaaagaa atatttgaaa agatactaaa gaaatgtggaggtctaccgc 28320 tagctataat cacaataggt gcagtccttg aagggaaaga tataaaagagtgggaaattt 28380 tgtatgctca acttccatca gaacttgaaa gcaatccaat cgctgaaccaatgaagaagg 28440 tggtaaccct tagttacaat tacttgccat ctcatcttaa gccttgctttctgtaccttt 28500 gcatctttcc tgaggatttt gatatccaaa ggaagcgcct agtacatagatggattgcag 28560 aggggtttat tagagctagg ggtggagtgg gaattgtgga tgtggcacaaaagtattttg 28620 atgagttgat caaccgaagt atgattcaag catctagagt ggatatagagggaaatatta 28680 agagctgccg agtccatgat atcatgcgtg atgtcatgat atcaatatctagggaagaaa 28740 attttgtata cttgatgggt gatgatggaa ctagtgtagt ggaggaaaatattcgccatt 28800 tagtgcacca tgatactagc aagtgttcaa atataggcat ggattggagccatgtacggt 28860 cattaacttt gtttggcaat gagagaccca aagggctatc tccttcattttgttttcccc 28920 aattgaagat gctaagagtg ctggatcttc aagatgtcaa atttgggatgacacaaaaag 28980 atatcggaaa aatagggttg ctgcgtcact tgaaatatgt gaatattggagggcattcaa 29040 gtatttatgc acttcctagg tgtataggaa aattaaaaga cttgtgcactttggacataa 29100 ctgacagtta cattacagaa ctaccaactg agattagtaa attgcagagtctatgtattc 29160 tccgttgtag aggaagacca aactcggggg attttaatct aaatgatcctaaggattgtc 29220 taattgcctt ctcatgtttg cctctgctta tggctgcaac cgattctgatgaacgtaata 29280 aaataattgc tgagctacac gtgggttgtt caagtcaatg gtctcctaatggtggtacat 29340 atggtgtgag ggtacctaga ggaatcaaga atttgaaaag gctgcaggtgctagagacag 29400 tggatatcaa tcgaaccagc agtaagtcag ttgaagagtt gggagagcttatccagctaa 29460 gaaaactaag tgtagtaaca caagggtcca ccaaggaaaa atgcaaaatactctgtacag 29520 ccattcagaa gctaacttcc ctcaaaactc tctatctgaa tgctcatggacccttggata 29580 ctggaacact tgaatggcta cattctattt cccatcttcc ttccctgaggatcatcagat 29640 tgatcggata catgaaagag atgcccaact ggtttaggga gctcagacagctggtgaaga 29700 ttcacttgca gaatagccaa ctagaggagg ataaaacaat ggagatactcggagaactgc 29760 ctaatctcat gctccttttc cttagttggc gagcgtatgc tggggggaagctggtgttca 29820 gggagggaac attctaaaat ctcagggtgc taatcattcg caatcagaagcaactgaaag 29880 aggtgagatt tgaagaggga acctcaccct ggatggaaag gattgatatcagagaatgca 29940 gattgacatc agggattgct ggtatcaaac accttccgag gcttaaggagatttcacttg 30000 aatacagtgc taaagtggtg aggctaggtc agctagaggt tgaaatgggcacacacccca 30060 atcgacccat gttgcgcctg tttggggagc gaagccgtca tgacctggggaacacccaag 30120 taactgtcga tgaacaacaa ctgcaggtca gtccatttat ttatggtgtgcatgcaacta 30180 cttgtttttc tttctcttct ttctcttgct tactattttc ctccttccttgtttgtctct 30240 agccaaacca aacagttagc gaagaatgct tgatggatgc cgatcttgcttctagatgat 30300 gatgatcagc ttcctgttgc aagctctcct ggtgatcgtt cttcttcagttcttgaccta 30360 ccaccaatgc tggctagctt cctgtagtgc tgatctcgga gtcccctgctgaattttgca 30420 cctccggcgc cgagaggcca gttcactgta catcccccct gagacacaatgtcggtattt 30480 gtttgattgg atatgatttg attaagtacg atgatttttg gttctgaaaatttgtatcat 30540 gattggttgg agcgtgtgat taggatgctt atatttgcag cagaggtcgatcgtattttg 30600 ttttgctgtg cacacaaaat gttcgtccca aatgtcttgt cagatattttgtagcttttg 30660 attatcaagt actccgtatt tatgtttcgt aatatgaaac accattgtcaggctctgtgc 30720 acgaattgat gcaatgtaat gatctaagtt gcggttccat gtgttcatgctcttcggatc 30780 aatttagtgt actgtttcta tctaacgtac cttctttacc atattttcttaaaaaggaac 30840 taatatatct taccaatcgt ttctgatgta atatttctcc atccattgtcatactgttgt 30900 tgtagacatc aaaatagtac ttttttttgt ctctcaaaaa tttatctgtaccgacttgta 30960 tctaatagat cattttgttt ctgtgcaatt tgaaccagcc cccttttgccaaaccattat 31020 ggtgctggtt tgcgaatgtt cagttggaga gtagcttgac caaatgggcattcagcttaa 31080 ttgcaatgga ttttagtgta aaatcacatg atgattaatc tttgtgatgaacaaattgtc 31140 aaaggtgagt gttttggaaa atacgatgaa tagtgttttc tgttgtgaacaatgcaggtc 31200 gacactggat tgccggtcag accaggtggt gcacggcagt caaaccggaggtatccgggt 31260 ggtctgaccg ggtagcccga tagtcagacc ggtagactcg gggggtggggggggggtcgt 31320 ttcgagtttt ttcatggaat ttatagatta attgcttcat gtttatgagttgtagatggt 31380 ttctattcat atgtgacatt gtgtgctgat gtgaatcaag ttggtgtgaacttgtattcg 31440 gataagaagt ttctttgttg ttaatgtgca gattttgctc gaggccatgggagtatatcc 31500 agtgatggac cgggactagg cttgggagaa gttaaagtcc agacagtcaagaatatcacg 31560 cgggatgctc atgctagaga acaaatgcat gtgaaggtga acgttttcatatgcatacac 31620 gtggtgttgt gtgcgtatgg aaagatgagt caaatttgga ttagaatccaagtttgaaaa 31680 ggttggagtg tggagttgta tggggatgtt atttccctga tttggtagctttcttatgtg 31740 attaggagtc ctagttttaa ttggatttgt ggctagggac aaccatgctgtgagtataaa 31800 tagatgatgg gttgaagcta ctatatatct actttttggg agagaaaagttaggatttgt 31860 tttgaggctt caattctagt ttgtgagttg agagaggagt gctttagatgcactttgtaa 31920 acgcatattg atgtaataaa gttgatatat tttcaagagg ttcatcgatacgtgtttttc 31980 tggaccccaa cggtttgacc ggagttcatt gtgcggtcta accagcaggcggtcacaccg 32040 acggcaggga ggcggtccaa ctagtggtaa gtgggtagtt agaccgatggtgtgcaggcg 32100 gtcagaccgg tggaagtatt gtagtcagac cggatgtgat ctagcggtgtgatcgacgca 32160 acttcatcaa gtttgagagt aatcttttaa ttccccaaaa aaatttggatttggtgttcc 32220 taccattcaa ccccctctgg tagggttggt tgcttgtgtt cgatcttacatttacttgct 32280 cagagtaaaa ccacagatca tattcattct cactaatctc tagcaagggatgggagcaaa 32340 aataaaaaca atccaatggt ggtcgacgta acaagacagt ttcaaccactaggtatgaaa 32400 acagagcgga tagttgttga tggttgttaa gtgtaaactt taaccgtcaacttctgcata 32460 aaaaatatca aatatactat ccaaacttag tggtaggagt ttactactaaccacttccac 32520 tagttttggt aaatatatgt atgcaggtta attaaggaga aaatacacccgacatgtgac 32580 gaaatgaacc tccggccaat catgtgcgag tacaagggtt atattggagggcccacaagt 32640 cagtcgaaac caacttgaaa tgggaccaaa ctatcctaac cgccataatggcccactagt 32700 cagtctcatg gccaaagaag aggccagtag gtgggtccca gggttcgaccgaaccgcctt 32760 tagcgcctct ccatcccggc cttccacgtg gacgtaccag attgtctgccaatgacggta 32820 taggggtgtt aacaacattc ccaactgcct aatgactata tatgtttttcttttgcctaa 32880 tgactagttt ttttgtttta ttaagctatg ctatactact tacggttataacggttatag 32940 atgaatatag ttcttgttta ttatagctct tctgaaaagt gcagtgtacaactagttgta 33000 ttattgttaa ctgataaaac atagaaatgt tgatttaatg agctacatctcttcttcgtt 33060 gatatgagaa gcttggtttg agttgttttt atattctaat aaatatttgttactagaatc 33120 gctccatttt cgtatttgaa aatatttgat tatgtttttt atgtggggtttctgattcca 33180 attaaaaaaa atgaaaataa aaatggtacg ttcgttttcg agccgtttcatccctacttg 33240 cacatgcatg ctgggaacaa tggatctcac taatcaagta gcacgcagttctttatattt 33300 tgttgattat ggcttgtgaa gcatagattc accgaatcaa ttgggatgttaaaatagtca 33360 catacatacc taagcttgca tgacgttatg tagatagaga tggcgaatataatgcgctgg 33420 aaagtcccaa gtgaggatgc aaaacatctt atagtgggta gtggagccctgcaaggaccc 33480 taatatctaa agcgcaccta aactgatatg gacatccatt atagttaaagttagggggaa 33540 tatgattctc ttcctgtgca cctaaaccgt aatatgcagt gaaacgaacgctatgatacg 33600 atgataagct taattcctct ctctgctcag actgttcagt gcaaaagctaccaacgagag 33660 cttgtctcct tgtgcggtcg tgagcttgct tgtgctaagc ttgaagggagagtcgaacga 33720 atccatggcg gagacggtgc tgagcatggc gaggtcgctg gtgggcagtgccatcagcaa 33780 ggccgcctct gccgctgcca atgagacgag cctcctgctc ggcgtcgagaaggacatctg 33840 gtacgtactg cactgctctc gtttatccta gcaagttctt aggctcttaatctcgaaatt 33900 gaggaacacc atgaaacact aaaagagagc tcgaagacta ggaaagaaaactagaagact 33960 aagctttgaa agtcttctaa atccaagcat ctcgacattg atcatccttgtgcaacatca 34020 tcccttccta ttgcttcacc agaatcggtg tcccttgtgg agatctctgtcgtagcgtca 34080 aggggagaat ccgagaagca gaactagtcc gcgctgcctt cgctacgccatctccgccat 34140 agaggatctc atccacgaaa catccaccat ccaaacggga aactgttttaaacactcggg 34200 tggatattca cccgtttctt gcatgtcatc taaatggtta tgaaaaattttcaaaaaaaa 34260 aacatgatag gttaatatat aatatatcat ctcacaaata tgcaagttcaaattcaactt 34320 ttataagttg taagtataac aggacgttca tctcacaaat atgcaagtttaaatttaact 34380 tttacaagtt gtaagtgtaa cagtacgtcc atcggataga ttaatatccatctccccatc 34440 caaacccgtt gttgcaccat ctgtcgaatc cggctgtgga cgctcggaggcaagagctag 34500 ctcacccgtc ccacacacac acccaacgac gtcacaagcg cctccgaacaacgccaactg 34560 ataacttggc agctcctacg tgccgacgtc gcggtacttg ccggcgctcctagcgcacgc 34620 accgtcgaac cacaccgtca ccgaccaact acccaccgcc gccgacttctgcctcatctg 34680 ccatcgtcgc cctagcccaa gttatcatcg tggcaattgc cgaggctcctaagtgtgcca 34740 cggccgaggc aaagttctaa ctgaatcaga gtatcagaca gccaccaccgacgctacttc 34800 tgcttcatct gccatcgccg tactagttca agttgtcgct gtggcaatcatgggccctcc 34860 tagcgtgcca cacaaccgga cagccacgac atcccccatc actgttgttattgccgcgcc 34920 ctgaccccta tcgtcgtcgc tcttagcgcg tcgtcgagcc gaccagccactgtcgtgcag 34980 atgaaaaaaa aacacatatt ggcctgagag atctgcttag ttccagtgcaggtccaacat 35040 gctgtgagat gcgggcgtgc cagtcagttt gatcttgcaa ctgacaagatatataaatag 35100 cagataaaac agcctatcga ctaacaagcc gatggagtaa ttccagccgatagccgatat 35160 tagccgatgc cgattctagc cgatgtcgat agggttttga actatcggctatatgtccaa 35220 tgtaggtaat gatataaaga caattggctg atgataataa aatataaaaatataatccaa 35280 tagaaaccaa tcggctaata ataagtattg atccgatagt taaagcatacatcggctaaa 35340 agtccgatgt cataaaatcc aatcgattta gataaacagt gaaacctttgttgcaatcgg 35400 ctaaatccaa cttgtatgta atcttcgtaa gccgatgaac gtccagataacttatcggct 35460 agcacctcga taaaacacta gcatgaacct atcggcttaa caagatttatattatcaaca 35520 acaatctagt aggtcggacc taaccgatgc aacacgtatt agatatgataatctaatact 35580 tgatgagcca ataaatctgt ctaatgtgat ggatataaca aatctatttataaaagcatt 35640 gcgattgtag agatatatcg gctaagacag aatatcagac ctaactaaaccgatgcgtct 35700 ctaaacacaa tgcaattaat tagagatata attgagatat cagctaggcaaatatatcaa 35760 tcaaactaga gcgatccaag agatcggagc aatgcagcct tgaacaacaccaatgtagcc 35820 gatggattca ccagggccga cggaacgtag gacttacccc ttccctgaagatcgggctga 35880 accaatgcag tcccacgtca ggtgccaaat tccgccggtt gataagtaaaaccttagaaa 35940 agaggatgac gatgcgccga gagtagtatt gatcgagaga taaattgcaatgaccctgga 36000 tgtacatatt tgtacccatg ggtagatatt agttcttgta ggacaagaaagaaactttcc 36060 taaagataaa atgaaaacat aaagtcttta ttggatacta aacacactttcctaaagata 36120 aaaggaaact aaaccctgcc taattaatag ataaactgcc atgtcgtatcctccttgaac 36180 tcggactctt ttagataagc ttcctttaac taatctttac ccgaatccatcaagaataca 36240 aatgttggca ttgatagttt tcatcggtca attataggac tttgaagccgatactgactc 36300 taagccgatg actactttgg gcttaccaaa ttttgttgtt aatatgtcgcgaccaccatc 36360 accggccagc caccctgatc attgttgttg actcagcatt cgccaggctgagcagtccac 36420 atacatgccg ccatctccat ggcagtgtcg ttgccgcccc tttctcctagagccgccgca 36480 gcgctcttcg acacacctac tgcatcgtcg agcagtcgtg ctaccacctcctccatcgac 36540 catagccgcc tcttctgctg caccggatcc acccacacca accaccagatacagtcaagc 36600 cctcattcct ggatcccata tccatccatg ccactactgt gctgcccagtccaaggaatg 36660 gagcgaagga ggaagccccg ccgctgccct cccggcggcc acatgcactccagtgccttg 36720 ctccgacggc agcgaggttg gaaaatgggt ggcagcggct agggtttatctggggagaag 36780 gaaaaggaga gggagggggg gggagggtcc acttccagct taattagcctagatcttatt 36840 gacaaatcag ttgctgggtg cacaaacatg ttattttttt tgcatgaccaatcttgaaca 36900 cttaggtatg ttagttgagt ggacactggt ctatctgaaa catctcttcacatggaggct 36960 gcgaatgagt tttctttttg agagaccaaa gtttcgttgt atgttaagtgataaagcctt 37020 ggtaagaaat gctaccacaa acgaactaat aactccaaac gtaaagtggaggaacccgta 37080 tgggtgactc gagtggcgac aaactctagc acctccacct ccttggacgggctgcggcgg 37140 tgctttcggc atcccagtct tcttggaggc atcatctaga attaaggtcttgttattgct 37200 tagcatgcct tagggcacgt ccagtgttta gttcgactaa aacttccatgaaagccaaac 37260 aaaagttctg tttgaccacc acagtgtaaa aatcgattgt gggacccatgcaaaaaaatc 37320 acaatctcag ctgcctatgc tctcctcctg gacctgatag ccgtgcacaacaaatatttt 37380 tttaaactgg atgtgttcgg cttctcttta aagatcgttt tttcctctgacacttaccaa 37440 ccggctttca cagtgtggtc agttcttttt ttttttacgc aaagtttgattttagtcaga 37500 cacgggagga tctgttaagc aggcttggaa atttcggacc cctccaatacaatattattt 37560 tagccaaaat ttctaatttt ttaatttttc atgaattttg gtaatatttgttctaattta 37620 actaaatttt gttcaaaatt tcggtctatc agtgacctcc gatcaaatcagttaaaccga 37680 gaaaataaac catgctctta agagagtttg gtatggttca atatcaaaacttatagtctt 37740 gcaatttttt ctacccttta tctttttccc tgactattta gtatggatcgtttaaaaaaa 37800 agaaagccca ttggtgacca agggcttgtt tgattcaaga ccatccctagccttaccaac 37860 cttttggcaa tggcaaaaat tggttgttgc caaaaatatt ggcacaaattggctaagcct 37920 atgattggtt tctaccaaag ttgaattttg gcattcaatc aagccaaataatttggcaat 37980 aacattttct tatctatgga tataacatat ggcaaatatt ttggcattaccattttcttt 38040 ttgccaaaca tgttattcct tttgaatgac caatcttgac accttatgtatgttagtagt 38100 ggaatcgaca ctattctatc taaaacatct ctttacatag aggccgctaataatttttct 38160 ttgagataac caaattttcc ttacaagtta agcaacaaag cccattggtaagatatgcta 38220 cgacaaatga actaataact ccaaacgtaa agcggaggat cccgcatttcccacgtgggt 38280 gactcgagcg gtgacaaacc ctagtacctc cacccccttg ggtgggttgtggtggcactt 38340 tcggcaccgt attttccttg gacggatcat ttagaaagtc ctattattgcctagtatgcc 38400 ttgacagttt aggcaacact cttggatggt ggtgtccttt gccctggtgatctagtagcc 38460 catggatgtt tagttatttg gacatggtgt tggatggtgc gctcgtgggcctgttgtagg 38520 tctggtgcca accagtcatg cttagaaata gccggatagg tgcacagtgctagttcttta 38580 cttggtggtt tgtgcagcgc tatcgacatg tggtggtgtg ctttttctttgtccggataa 38640 taatctcata gggctatact cttgttattt tgctgctata ttattatgataacttggtat 38700 ggttcgtttt ttcttttttt ggaaaaacac ctagttgatc aagggcttgtttggttcaag 38760 tgcattccta atcttacctt ttcttttttt tttcaatggc aagaattgttcattgcaaaa 38820 aaaaaaagag ataaaaattg gctaggctta cgttttggtt cttaccaaagttgtactttg 38880 agaccaaata tatggcaaaa ttttggcata accttttttt tttttgcttggttgagcttg 38940 gtacaaacca atcagtcaca aaatagactg tcatgaatca cgcctactaaattcctttga 39000 accgaactag aatatatttg ctcttaaaag atttcttgat ttcaattggtaccatttact 39060 agtagaaact taaatttaaa ttttaaaaac aaaatcataa tattgttgttatggaaattt 39120 tagtcatttt agtacttttg taatatatga gttgggttat acttgagatatcctaaattg 39180 ctttaagatg aacaattgct aggtatatca aagatgagct aaaaacaatgcaggcattcc 39240 ttagagctgc tgaagttatg aaaaagaaag atgaactatt aaaggtttgggcagagcaaa 39300 tacgtgacct gtcgtatgac attgaagatt cccttgatga atttaaagtccatattgaaa 39360 gccaaaccct atttcgtcag ttggtgaaac ttagagagcg ccaccggatcgctatccgta 39420 tccacaacct caaatcaaga gttgaagaag tgagtagcag gaacacacgctacaatttag 39480 tcgagcctat ttcctccggc acagaggatg acatggattc ctatgcagaagacattcgca 39540 atcaatcagc tcgaaatgtg gatgaagctg agcttgttgg gttttctgactccaagaaaa 39600 ggctgcttga aatgatcgat accaatgcta atgatggtcc ggccaaggtaatctgtgttg 39660 ttgggatggg tggtttaggc aagacagctc tttcgaggaa gatctttgaaagcgaagaag 39720 acattaggaa gaacttccct tgcattgctt ggattacagt gtcacaatcatttcacagga 39780 ttgagctact taaagatatg atacgccaac ttcttggccc cagttctctggatcaactct 39840 tgcaagaatt gcaagggaag gtggtggtgc aagtacatca tctttctgagtacctgatag 39900 aagagctcaa ggagaagagg tactttgtta ttctagatga tctatggattttacatgatt 39960 ggaattggat aaatgaaatt gcatttccta agaacaataa gaagggcagtcgaatagtaa 40020 taaccactcg gaatgttgat ctagcggaga agtgtgccac agcctcactggtgtaccacc 40080 ttgatttctt gcagatgaac gatgccataa cattgctact gagaaaaacaaataaaaatc 40140 atgaagacat ggaatcaaat aaaaatatgc aaaagatggt tgaacgaattgtaaataaat 40200 gtggtcgtct accattagca atacttacaa taggagctgt gcttgcaactaaacatgtgt 40260 cagaatggga gaaattctat gaacaacttc cttcagaact agaaataaacccaagcctgg 40320 aagctttgag gagaatggtg accctaggtt acaaccacct accatcccatctgaaaccat 40380 gctttttgta tctaagtatc tttcctgagg attttgaaat caaaaggaatcgtctagtag 40440 gtagatggat agcagaaggg tttgttagac caaaggttgg gatgacgactaaggatgtcg 40500 gagaaagtta ctttaatgag ctaatcaacc gaagtatgat tcaacgatcaagagtgggca 40560 tagcaggaaa aattaagact tgtcgaatcc atgatatcat ccgtgatatcacagtttcaa 40620 tctcgagaca ggaaaatttt gtattattac caatgggaga tggctctgatttagttcagg 40680 aaaacactcg ccacatagca ttccatggga gtatgtcctg caaaacaggattggattgga 40740 gcattattcg atcattagct atttttggtg acagacccaa gagtctagcacatgcagttt 40800 gtctagatca attgaggatg ttacgggtct tggatcttga agatgtgacattcttaatca 40860 ctcaaaaaga tttcgaccgt attgcattgt tgtgccactt gaaatacttgagtattggat 40920 attcgtcatc catatattca cttcccagat ccattggtaa actacagggcctacaaactt 40980 tgaacatgct gagaacatac attgcagcac taccaagtga gatcagtaaactccaatgtc 41040 tgcatactct tcgttgtagt agaaagtttg tttatgacaa ctttagtctaaaccacccaa 41100 tgaagtgcat aactaacaca atatgcctgc ctaaagtatt cacacctttagttagtcgcg 41160 atgatcgtgc aaaacaaatt gctgaattgc acatggccac caaaagttgctggtctgaat 41220 cattcggtgt gaaggtaccc aaaggaatag gtaagttgcg agacttgcaggttctagagt 41280 atgtagatat caggcggacc agtagtagag caatcaaaga gctggggcacttaagcaagt 41340 tgaggaaatt aggtgtgata acaaaaggct cgacaaagga aaaatgtaagatactttatg 41400 cagccattga gaagctctct tccctccaat ctctctatgt gaatgctgcgttattatcag 41460 atattgaaac acttgagtgc ctagattcta tttcatctcc tcctcccctactgaggacac 41520 tcgggttgaa tggaagtctt gaagagatgc ctaactggat tgagcagctcactcacctga 41580 agaagatcta cttattgagg agcaaactaa aggaaggtaa aaccatgctgatacttgggg 41640 cattgcccaa cctcatggtc ctttatcttt attggaatgc ttaccttggggagaagctag 41700 tattcaaaac gggagcattc ccaaatctta gaacacttcg tatttacgaattggatcagc 41760 taagagagat gagatttgag gatggcagct cacccctgtt ggaaaagatagaaatctctt 41820 gctgcaggtt ggaatcaggg attattggta tcattcacct tccaaggctcaaggagattt 41880 cacttgaata caaaagtaaa gtggctaggc ttggtcagct ggagggagaagtgaacacac 41940 acccaaatcg ccccgtgctg cgaatggaca gtgaccgaag ggatcacgacctgggggctg 42000 aagccgaagg atcttctata gaagtgcaaa cagcagatcc tgttcctgatgccgaaggat 42060 cagtcactgt agcagtggaa gcaacggatc cccttcccga gcaggagggagagagctcgc 42120 agtcgcaggt gatcacgttg acgacgaacg ataggtcagt cactccctacatggcagctt 42180 aattaacttg tttctaattc tcttcttgtt cagtattagc catcaggtgagggcgatgat 42240 ttcaactcac ttttcatctc tctcgttttc ttaacctgac agcgaagagataggcacagc 42300 tcaagctggc tgacgatctc ctcccccatc agcgtcgtca tcagcgaacagatagggcag 42360 ggcttccctg cttctgcgtg cacctcactg ctctgactct gagggacatgatgatcaatg 42420 aggcttccag ttcccaaatg tgcgcctaac acactcagtc attcccatccaaggtatgaa 42480 ttgatcgatc tgatttattt cttgccgtga ggtgcaaatg tttgatgcttactttgatca 42540 gggtattctg tttcgggctg tgatatgcac aattactctc aattcgtgttttgatgctga 42600 gtttttattt ctcttcttac ccgtgcactc ttcatttcca tttcattcataacagaaaac 42660 aatttgattg cattgtggaa gggaatatga gatcagaaat cagatggttagttgtggttt 42720 tcttatttcg tttgctatga gcaccaatcg tttgctagaa tgtctgaaagatcttgtgta 42780 catatggtgg actgaacaat tgaacattac aagttatcat attttatattgttgctaacc 42840 gatcacatag tccagtagtc cagtagtaag atttttgtta agtttattgttactgaatat 42900 attgtttggc ctgcagttgt tatttctctc aaaacaaaat tatttggtagtctcaagtac 42960 aaaaagaaag acagatcaga caagttgttt tactctacta gtttcaaattgatcatctct 43020 gtttgttctt cattcatttt ctttctgtaa gagagtttgc taggatggtgatgtggtcat 43080 gtggatgatc aaatggacta catcagacgc atcacactgc tgcccaacctttacccactg 43140 tagacaaatg gagtgcaggt cctaaaccag gccagaagtt tgttcagtgttcttgttcca 43200 aaataaacat tctggatggc aggttatttc attataacat tcactcttatagctttctta 43260 gtcaaaacta caaataaggt ctcctaaaaa atgcatcgac gttgatatctgtgttttctg 43320 ccatgcagaa tgacttgctc tcaatggttg aagctgcaat cctccatgctcatttctggc 43380 tgaacccaaa ttggtggctt ggaggctgga gctgcatgac attagagataacaatggcca 43440 ctttttgttg ccatgggtga gggtacatgg atcatgccgc tgagcctctactccgagcaa 43500 gcagaaaact ggctggctgt agatcgagtt cgccatcgcc ctcaactttgtcgatgcgat 43560 ggtgatcatg gatagatgta tgtcacatag cgcaaagcgg agcctccaatgttcgcaacc 43620 gtactgtaaa tgtgggagcg gcgaatttct caaggggaca accatggaacagaggagatg 43680 gagcaaagta gttgctcttg ctatcatttc agagctcagg ctgatctctagctcagatag 43740 gagtacgaat tcttatgtgt gtgtatttga tctatgatag tacgttttaagagttgagcc 43800 agatattgct tgatcattac atgatgatat gtaaaaaatg gtattgacaaggagacgcag 43860 gaacagggtg gttcattgct ctgtttaatc ttaatcttgg agagctaggatggaaaactg 43920 agttggtggt tatatctcta ctacttttga tgttgtccca aactaatttgtttctactca 43980 tgttcgtacc taaaaaagga atattttagt gattacagaa cttaattttctctttaatca 44040 gattatttat cagtgggatt tttctttctc tagttctgta tgaaaatactttttttatcg 44100 tcaatcctcc taaaattttg tgatatcagt attttttttg ttttactggaatgggctgtt 44160 tcggtgctgc ttgcttggac ttgctgattc ctccctcttt cctatttataaactcatttt 44220 tcttcagttt tttttcctgt attttggttt tctttcctct acgtgactacacattttgaa 44280 tcgaacatgc tatgctctgt atatctgctt ggaatactta ttaaatgcataggccggcca 44340 tttggaatgt gcacttaaca gttgtttgaa cacttccatg gatttgtttcctcagttgtc 44400 ggacggctat ttaaacctga ttaagaattc catgtgcaga gacttgtactagcgtcgaag 44460 actttgcttc ggtgactcgg agtcaagtca aggccggcac agcgcaagccaataagctac 44520 gacgacgacg acgacaacca tgccgcgcta ttgcttgtaa actttgtctacaaaagccag 44580 ccgcgatgcc atccatccac tccttccttc cccattgttg ctcagccactcggcgtcgga 44640 ttttcctcta ccacaccagc tctaccaacc tttcccgaga ttagagaggaagaggaggac 44700 gccgaagccg ttgcatcaag cccaggacca ccgtcgaaca cctgctgtgcgcaccgctcg 44760 gcattccttc gcttcaccac cgtgtgtgca cagttcccac cagagtgtgcaaagcagcat 44820 aggtaagtca actccgattt tctgctgttc tttttttttt taagataaagcaggagctct 44880 gctattcaat taagcatgga agaatttttg ggtattttgt gtatattctggccttgttta 44940 gttcccaaaa atttttaccc aaaaacatca catcgaatct ttggacatatgcatggacca 45000 ttaaatatag ataaaaaaaa taattacata ggaggttttt ttccggtccttgagggaagg 45060 cagtaccata tcctagccgt tgattttgca tgatctaacg gctggaaaacctcggtaccg 45120 cgtggtaccg cgtttctgtg agagtaggta ccgatcagtt tttgaggtggaagggtatca 45180 ttgtaatttc gcgtcactta tcccgatcaa cctaaccgtg gacgctgctccttcgagctc 45240 gtcgctgcgg ggtgtatgcg acggcgggcg tggcgtgacg gcgatatgcgacggcgggat 45300 gcgacggcag atgcggcagt gaggaggcgg cggcctcgat gcggcggcggccgcggcgcg 45360 atggcgggat gcgatggcgg gcgacccaag ccgagatgac gatgacaacaagcatgcgag 45420 attgatcgat gcggccgcgg tgcgacggtg ggatgcgacg gcggcggccgcagtgcgacg 45480 gcgggcgacc caagctgagt gcgacggcgg ccgcgtccct cgtcggctcgcgggacgcgg 45540 cgggctcgag ggggacggcg gtggtggctg gacgcgcggc ggcaggggcggctggacgct 45600 tgatggtggc gggcggctgg atgcgggacg gcggcggctg gatgcgcgactgcggtggcc 45660 agctggatgc acgatggcgg cggcggcctc gattggcgac gacgacggcgggctggatgg 45720 gcgacagtgg cggccggcgg gatgcacgac ggcggcggcg gcgtcgattggcgacgacga 45780 cggtgggctg gatgggcgac ggcagcggcc ggcgggatgc atgacggcggcggcggcctc 45840 gattggcgtc gacggcgacg ggctcgatgc cggcacggat ggcctcgattggcgtcgacg 45900 gcgacggcct cgatgccaca acgaagagag gacatctgtt tttgttagtccgatcatacc 45960 cctactaaat caatggtcag attagattgg tacctcatgg tacctcctcaaggataggaa 46020 agatgctctt gctatgaaga acctccgagc tcagtcaatg tccggaaaataacttggata 46080 actacaagtg gaagccattt ttggttggtt ttctgcatga tctgtttgtccctgttttat 46140 tatatcctgt aaacttttca ttgcgcagtt tgattaatac atgtctatttgatattgcag 46200 ataaatacag ttgtagcctg aaagatagta cttacaatct attgcttgaaagtctaaaga 46260 aagttgttca gagaaggaag atcagatggc ggatacagta ctcagcattgcaaagtccct 46320 ggtgggaagt gctgtaagca aggttgcttc ggttgccgca gacaagatgatcatgctgct 46380 gggagtgcag aaggagatat ggtgagcatc tgacttgcag cctaattaattttattttca 46440 gttgcattag atttattggg accacactta tgcagagtgg tatggtactcagtttatttt 46500 taatgattta tttatatttt tataccatgt tctggaggaa tgcatatgcagttttttcta 46560 taagtatatt atttgcaaca tcttggggag ataaatgtag aggaaagtgaaagtagaatg 46620 cactggaagt ccctctatat acccaaacac aagtggacat agttttctcacggcaaccat 46680 gttcaatgaa ggaatacaaa cgaggcagct attaaggacc tggtgataatctaatttcga 46740 cagaaacatg gtttccttcg agtgtaaaca atgtagcatg gcatgcttaagtcaaatgca 46800 tatacaattt gacaaggaac tattagtttc agtgctgtgt aattttgctttttgtaggat 46860 cgaacagaaa taactaagcc aaccagagaa gggagggggg gggggtgaatagttgtagta 46920 ccaaaaacca aaatttttag cggaattaaa agttaccctc gaatcgataaattccgatct 46980 gaccaaagta gatacgccgg tctgaccgct tggatcccgt cggtctgactggagtatatc 47040 gtccggtcta accgcccgaa gaagctgaag tcgccgccgg tctgaccgccgtgtacccgc 47100 tggtctgacc gccgcgatgt cgccgatttg accgccggtg tcccaccggttagaccgccg 47160 aactcaagta aatacaaatt gaagatctct caaagtagat gacaactttattgcttctct 47220 ctatgtttac aaagtgcaac aacagcactc ctcacgaaaa tctcgactaaactcgaaacc 47280 ctaactattc tctcaactca atactctcta aagcgatacc gggaggccacaccctccctc 47340 tctatttata catagggtag gcagcctaaa gccacaaatc aaactcatgcaagaagtcct 47400 aatccacata ggaaaccttc ccgtacaaga aaccaacttt acaaactcaaatcataccaa 47460 atttagactc cttccaaatt tgactccaca tcctatacgc acacaatatttccattgtat 47520 gccatatgta atcttcacca accacgtgca tttattttta gcctaagtatcccgcatgat 47580 atctgacggt ccggacgtca ccttatctcc aagttgactc ccgatccatcgccgacaata 47640 ctctcccgag gcatcaaaac acctacacat gaatcaaaca aagaaaccatattccaagac 47700 caagctatct ccaacttgac tcattattag caaacaacag tattacatacgcatagtatc 47760 catctagaag ttataagcat gaaacatcca cagatataaa aaaaacaacccgaaaccgaa 47820 accaacacag agtcggccga tcagaccgcg ggctggccgg tctgaccactcacataactc 47880 tggtctgacc ggcaacccat gcccggtctg accggaccaa aactctagtagcacatgttc 47940 atcacctgca aatccaatca tctccaaaat cacttcacca ataatctcctattatcaaaa 48000 ccagtaatct cagatgccaa ttgttcatca tagaataaga atgaaacacactttgattta 48060 cacttttcac cttggaatca aagatttaat gcaaattaag cttcacgttcctagtttgag 48120 cttccaattt gtacatattt gtgatttata atgctatata tacatgtatacagacaagta 48180 acgcagtcca cgtgggatgt tggccatgcc tggataagct gagcaagatttagacctgtg 48240 tcaaaacaat gtcttgaatt ccgttgaact ttttgtttac aggttcatcaaagatgagct 48300 acaaacgata caagcatttt tgattgctgc cgaagcatca aagaaaagcatactattgaa 48360 ggtttgggtg cagcaagtaa gggatctttc ctatgacatt gaagattgccttgatgaatt 48420 tacagttcat gtgggcagcc aaaacttgtc gaggcagttg atgaagctaaaggatcgcca 48480 tcggattgcc atccagatcc gcaatctcag gacaagaatt gaagaagtaagcactaggaa 48540 catacgctac aacttaatag agaatgacct cacctgcacc actactgatgagaggaattt 48600 atttatggaa gacattcgca accaataagc taacaacatt gaggaagctgatcttgtggg 48660 tttttctgga cccaaaagag agttgcttga tcttatagat gtccatgccaatgacggacc 48720 tacaaaagtt gtatgtgttg tcggtatggg tggtttgggt aagactactattgcaaggaa 48780 aatttatgaa agcaaagagg acattgcaaa gaatttttct tgctgtgcttggattactgt 48840 ttcacagtcc tttgttaggg tggaactact caaggatttg atggtgaaactttttggaga 48900 ggaagtactg aagaagcggc tgagagaact cgaagggaag gttccacaagtagacgacct 48960 cgccagctac ctcaggacag agttaaatga aaggaggtac tttgttgtgcttgataacgt 49020 gtggagtaca gattcatgga aatggattaa tagtattgcc ttccctagaaataacaataa 49080 agggagccgg gtgatagtaa caacaagaga tgttggctta actaaggagtgtacttctga 49140 attgcttatc taccagctta aacccctaga aataaactat gcaaaagagttgcttctacg 49200 gaaagcaaat aaagcaatag gagatatgga aagtgataaa aagatgagtgacattataac 49260 taaaataata aagaagtgtg gctatttacc gctggctata ctcacaataggaggcgtgct 49320 ttccaccaaa gagataagag agtgggaaac tttttatagt cagataccttcagagcttga 49380 gagcaaccca aaccttgaag caatgagaag gatagtgacc ctaagttacaactacttacc 49440 gtctcatctt aagcaatgct ttttgtatct aagcatattt cctgaggattttgaaattaa 49500 taggaaccgt ctggtaaata gatggatggc agaggggttt attaaagctaggactaatat 49560 gactattgaa gatgttggga aaagttactt taaagaactt atcaaccgtagcatgattca 49620 gccatcaaga gcgggtatac gaggagattt taagagctgt cgagtccacgacatcatgcg 49680 tgatattaca atttcgattt ctagagaaga aaatttcaca ctcttacccgatggcactga 49740 ctatgatgta gtacatggga acactcggca catagaattt cacgggagtaagtattgctc 49800 tgaaacaagc ttggactgga gcattatacg gtcattaact atgtttggtgagaggtccgt 49860 agaactagag cattcagttt gttcatctca gttgaggatg ttacgggtcttggatctaac 49920 agatgcacaa ttttctatca cacaaaatga tgtcgacaac atagtgctcttgtgccactt 49980 gaaataccta tgcattaaga tacagatacc gttcaccata tatttattcacttccacaat 50040 ccatagctag actgcatggt ctgcagacat tggacttggg tcagacgtacatttcaacac 50100 tgccaactca gattactaac ttcggagtct ccgtagcctt cgatgcatgaaagaatattt 50160 ttcttcttct ttaacaacat atttaactaa cacattatgc ctgcccatgatattcacacc 50220 tttcgttagt acctcggatc gttctgaaac aattgctaaa ttgcacatggccaccaaagg 50280 cttccgttca aaatcaaatg gtgtcaaggt accaaaagga atatgtaagttgagagactt 50340 acaagaggat tgctacggtc cagcaggttg taccgggcgg tactggtaccgcgcggtacc 50400 aaaacccatc taaccgttga atccgggatg ggtaggatcg ggagagaaaagatgagcaag 50460 ggtggatgag ggagtacctg tttcgtgtcg tcgttcccgg cggcggcggtgtggagtacc 50520 tgtttcgagt cgtcgttccc ggcggcggcg gcgtggagta cctgtttcgagtcgtcgtca 50580 ttcccggtgg cggcgcagag caacaaggga cgccggcggc gcgggagaggataaagtccg 50640 gcggcggcgc gagagagaaa aaagggaacg gcgacggtgc gggagaggaacaagggaagg 50700 acggcggcgg cggaagagga acaagtccga cggcgaggaa gaggaacacggcggcggcga 50760 aaatcatcca gcgtagctag ggttcgagcc gcccgatcca aacccatctattgcacgcga 50820 agttactctt ttacccttcc aactctcttc tccatgcggt atcacctaagggacattttt 50880 ggtaccgtgc ggtaccacgc aacatcagcc gttggatcag gccagatccaacggccagca 50940 tttggtaccg ctcggtacgt tggacagtaa aaaaactcga cttacaaatattggaggtag 51000 tggatattag aaggactagc agtagagcaa tcaaagagtt ggggcagttaagcaagctga 51060 ggaaattatg tgtggtaaca aagggatcca caaaggaaaa atgtgagatactctatacag 51120 ctatccagaa gctctgtttc ctacaatctc tccatgtgaa tgctgtgggattttcaggta 51180 ttggaacact tcagtgtata gattctattt catctcctcc tcccctactgaggacactca 51240 ggttgaatgg aagtcttgag gagatgccta actggattga gcagctcacgcacctgatga 51300 agttcaactt atggaggagc aaactaaaag aaggtaaaac catgttggtacttgcggcgt 51360 tgcccaacct catggtcctt tatcttcatt ccaatgctta ccatggggagaagctagtat 51420 tcaaaatggg agcattccca aatcttagaa cattttcgat ttacaatttggagcagctaa 51480 gagagattag atttgaggac ggcagctcaa tcttgttgga aaagatagaaatattcaggg 51540 gttggaatca gggattgttg gtatcattca ccttccaagg ctcaaggagatttcacttgg 51600 atacggaagt aaagtggcta ggcttggtca gctggaggga gaagtgcgcacacacccaaa 51660 tcaccccgtg ctgcgaatga gggaggaccg aagtgatcac gaccttgcttgtgacgccga 51720 aggatcccct gttgaagtgg aagcaacaga tcctgtgaga gctcacagttgcaggtgatc 51780 acgttgacaa cgaacgacag gtcagtcact ccctacacgg catcttaatgaacttgtttt 51840 atcctcttgt gagatcgatg attttaactc accctttcat ctctctcgttttcttaacct 51900 aacagcgaag agataagcac aacttaagct ggtttgatca agtgatgatctcctcctcca 51960 ttggcatctc cggtcgtccc tgcttctgcg gctgcgcacc tcgctgctccgaggaggggt 52020 gctgatctaa ggaggcttcc actttcttca attgcgtctc atgctctcgattcttccctc 52080 tcgggtatga attgttcaat ctgatatttt ctcgcgatct gctactggttccagcatgag 52140 catttgaacc agcagcttag aattatcgtt tgatcaggtg ttatttatcccttcttacct 52200 gggaactcta cttatccatt tcattcagaa cagaaaccat gtttattacactatagaggg 52260 gaacaacaga tcaggcacga gttgtggttt tgttatttcc tttttggtgtgcacaccagg 52320 tgattgctag aatgtctgaa agagcttgtg tgcatggttg gctcaacattatgtgctaat 52380 cactctttta taccgttgct gacctatcac atagtgcagg agtaaaattttgctatgttt 52440 attgttactt aatgttgatt ttccttttcg taaagaatat taactttttagataacaaaa 52500 agaatattat gtatcctgca aaaaggtttg atttggtgat cactggctagtacaaaagta 52560 acgagagatc agaaaaaaca aagttttggt tacactagct cattacaatttatttgctga 52620 acatgtgaaa aattaataat tgtcgtacgt catggttctc tggaagtctggctgcaggtg 52680 gttagttttg tcatatccat tatcttggtc taccttttcc tactaaactaggactatgag 52740 tagaaaaaag taattgttgg ttccaagttc aaaagatagg cagatgtgagctaaggaact 52800 tgcagttctt gatctccttc agagcttctc acctcacata aatggatctctgtttatccc 52860 aaagcaacat tttttaacct gctagttcca aattgatcat ctttgttattcttcatttat 52920 tctatttctc tccctaaggg agttttctag ggtggatatg tggaaaatcgaatgcagcta 52980 tgctgttcaa accactgtca aaagatgcag gacaggatgg gcagatgtttattcagtgtt 53040 cttgttcaaa tgaaacatat gctattctgg aagaggttgg ttaacttacattgtatataa 53100 cccttacctt gttactatcc tcttgcaaaa tgcatggatt ggtgaaatcatgttttttgt 53160 catgcagaat tgttttctga agctcaaagt tgaagctgca gtcttcactgtttctggcag 53220 aacccaaatt ggtggcttgg tgctgcataa cctgcagaga acgcaacggccactgcttgt 53280 agccttcaag gatgctacat tattgatcat ctcactacgg tcccgatcaaacttctgatc 53340 gagttcgcca ccgctgtgga ctttgcatat gttgatggag tggtgcgtatgaatccaggg 53400 ggcagagcca ctgccctgcg accttggggc ttgtcaacga gcatacagtataatattttg 53460 ctgttctttc agtgattaaa aacgaaaatt ttaaagaaaa cattgaatgcaaattagatc 53520 tgcccaggtt ctcaaaattt ctttagctcc gccactgctt gaatcgatgtcatatgctgc 53580 aaattaagtg gattggagat gtgcatatgc tcgaatgttc tcggttatattgtaattgtg 53640 gggtagcaac tttatcttgg gtacaaccat ggaacaaggg tgaaattgtactaattctta 53700 tttgtgtaca attgatatat ctcattactt cttgcattct gttagtcatatgtatttcca 53760 tacatcgttt gcacctgcta tggctgcttg aggatatggc aaagcttaaaagatgatgtt 53820 aacatggtca catggagatg cagggcactc cattttctgt ttttctctcaatcttatgta 53880 gagttaatat atggacagta gcaatagtta catctttctg tcaactaggcatactaccca 53940 cgtgttgctc cgggtctttt tgggatggtc tctattaaga tgtaaacttatttattaaat 54000 gaattcaatc gcgtcgcatt ggttcattta tttagacttg gttgtgatatactcctgttg 54060 taaaatataa taacttttac gattctagca ccatttataa tatttataagagtacctgtc 54120 tcagcaatca tcaatcattt tctatttagt tctttctatc ttacccttacatacctttca 54180 acactcatcc attcctctgg tgaataacta aaaatgttta aatttcagagcggaggtagc 54240 aataagttct agtaaaagct gttgaatagt cccacattgg ttgtaaaaggacaaatgacc 54300 taacatataa gtgggtgagc cctgtacctc attagctagc tttttgggtgaggtcccttt 54360 acgatcttat aattggtatt agagcctggc tagtttgaca tttacccgagggcactacta 54420 gaaaaaggac cgaccgccgc tagaccaaca aaggaacata gacgagatcgccccaaaaaa 54480 agcccccaca accaacacaa agcccaactc ctaaagcgtg cttgcaccaatcgttcgaga 54540 gatttcggct aggggatgcc aaaatgacgt cttcgagaaa agaagcgatggaaaaccgcc 54600 gccgccgtct gtcggggctc aaaggagcca agactgggct ttcgcccggcaaccaccctt 54660 gagggataag acatcacgac aacgccctca ggaagtcagg agggggaattaaccatcgtt 54720 gtcggtccgg ccaaggccgg gctgggtttt cacctgccgc tcaccacctgcgaatccacg 54780 gctgacgcac cgatgctcca ccaccactca acctctgccg ccatgtgggaccactgcacc 54840 ggcgccccct gtcagccaac cttcatgcgc cgaagaccgt gccacacccaccgacagcac 54900 ctcctcgcac tgagactgcc tcctccacta ccgcccaagc ctctcgcgccaagccggcct 54960 tctctactgg acgcgcctct cgcgccaatc caaccttcct ccatcggccgcgcctctcgc 55020 gccaagccgg cctccatctc ctccgcccgc gcctctcgcg ccgagccagcctccgctgcc 55080 agcagttgcg cctccctgca ccaagccggc tttcgacccc tcctccaaaggctaccgcac 55140 cgaccggata cggccgtctg ccacgccccc ggctagccgt ccgagaccgccatgcctccc 55200 cctatggcgg tggcgatcgc caccaactag ggttgaaagt gattcggataatttccgtcc 55260 gaccggacct tttttcggat tcggatagtg tcggtcggat atattcggaaatttgaattt 55320 gaaatcatga caacttcaaa tagcattttt aaatactaaa tgatttcaactgaaaaagtc 55380 atcaacaaca aagttgtata actcatcaag atttataact tttattttggtcatttcttc 55440 atccgataaa gtgatagtaa tattattcac aaaatttaca tctctcattaggttttatga 55500 actataagag agatatataa attttatgaa caatgttact attactttgtagaacataaa 55560 agttgtagaa ctcatcaaga aataaacctt ttattttggt catttttctaaaagtttgaa 55620 tttgaatttg aaaatatgac aacttcaaat aatattttca aatacttaatgatttcaact 55680 gaaaagtcat caacaacaaa gttgtataac ttatcaagat ctataacttttattttagtc 55740 attttcttcg taagacaaat tgacactaac attgttcaca aaatttacatctcttatttg 55800 gttttatgaa ctataagaga gatatataaa ttttgtgaac aatattactatcactttatc 55860 agatgaagaa atgaccaaaa taaaagttgt attccttgat gagttctacaacttttatgt 55920 tcgtgacttt ttcagctgaa attatttact gcttcaaaat atcatttgaagttttgaaat 55980 tcaacttttt aattgataaa acaaagtcac aagaaaaaat ggccaaaataatagcagtaa 56040 aaacacaata acatgataga gcatgatttt agaaacattt aggaaaaagaatcatccaat 56100 ttggagttca tatgagtgag ataaactagt ttcaaatttt taaattttattttcgcatac 56160 ggctccttaa gacgtccgta tggaaaaatt gatttttcca cgcgggctcttaagttgtcc 56220 gcacgcaaaa tgagctcatt ttggcgtctt gaggagtcgt atgcgaaaatgccgacgcgg 56280 caagttgtga tccatttgga aaaatcatag ggtctcgtac aaaagaaattgtttgtgtag 56340 tagcgagggg tttttatatt ccgattaata ttcatcaccg tattcgttttgctccgtatt 56400 tgtattcgat aatattccat ttcgttttta tatccgggtt tccagttccgaaaaaaaaag 56460 aaagtgaata cgatagagct agtttccgac catattcgat ccgttttcatccctaccacc 56520 accgcagcca ctactaccct tccatccccg ccgccatctt gccatcctcccgcaccttct 56580 cgcagtcatc gagctccgac ggcacacggc gcagacggcc caccgtggtagcccgcactg 56640 ccgtcgccac gaactcctcg ccaccaccgc ctcgaccgcc agactccttcgggcgctggg 56700 tctgccgtcg gcgcggccag gttcgcctca ccgacgccat cccctcgccacccccaccac 56760 cagacgctgc cgaaggctgc catcccctcc ccttgcctcc cctgccgccatcgccatccc 56820 cgccactaga cgccgccgcc ggccaccatc ccgccagatc caggtgcggatctggcggtt 56880 tcctccgtcg ccgtaaacgc ctcgaacgcc gctgccacca ccaccagacaccaccgccgc 56940 accgctcagc cctgctgcca gctgccccat cgccagatcc ggccgggcggcacagatctg 57000 ggctgttctg ctgccccgag caggcccccc tcctatgccc gagcacgaggatgaagcccc 57060 gccgccactg tctttgtggc cgcgcgactt tgccggcgac tgcttgggcagcggcgaggc 57120 agaggaggga aagggagatg agcaccggcg aggtcgtcgc ctcccagttgcccgtgggga 57180 ggggcgacac gagaggccaa gcgctactca actgcctgat gctcatccaaagtgagaaag 57240 atgcttgaag ctgtcgctcg aagcaacttt cagtcctcga tataattcgatataagtgat 57300 ttctctctcc atatttgttt ggagaaatgc tagttataag aaactaagtgtgaccatgtg 57360 ttatagatgt cagagaaaac agttcatttt ctcaattctc aaggtaattgggaaaatgga 57420 gaattactga gcgatacgtg ttgctggaaa attgagaatc actgatgatcgcctccatct 57480 gaaaattacc gagcgatatt tattgctgcc aaatcaaaat gattctattcaacccgtgcc 57540 atggcatggc cggatggccc cagttcactg attgatcact taatatatggactgaacaat 57600 aaatcatggt actttggttg tcgagcatga tcattatttg tggaccacagattcaccgaa 57660 ttaattggga tattgaatcg gtcacacaca cgacgggcgt actccgctcgtcttaaaata 57720 aatggattgg acgtgacatt atctactaca acgaatctgg attagatagtgtctcatcca 57780 atcctaaatt ggtttatttt gtgacggagg gagtatgatg ttaacgtagatggaaatgag 57840 gaattgagta gacagtgtgg ggctggaaaa taatggagga cagtaacatcttggagtgta 57900 gtgaggcctg gagggtcgtc cttgacatcc aaaccgcacc taactctatgataagcatcc 57960 tctctcagat tgttcagtgc aaaagctacc aatactgctc cgagagccagaaaaaagcgc 58020 tggtcgccta gtgctatctt ctatgcagtc gtgagattgg ttgctctaagcttgaaggga 58080 gagtcgaacg agtccatggc ggagacggtg ctgagcatgg cgaggtcgctggtgggcagc 58140 gccatcagca aggccgcctc cgctgctgcc gacgagacca gcctcctgctgggcgtcgag 58200 aaagacatct ggtacgtact gcgtgactct cgttaattta ttctgtagatgctcaggaat 58260 cagcaactat tgtgttgatt tccatcgtag catatcgatt ttgttggccaccaattctaa 58320 tcggccggaa caagctagtc actaaatctg gcaaatcgat cagctgctgagtgcacaaac 58380 atgcatgtta ttcttttttt tttttgggtt atatgttaag caacaaagccccttggtaag 58440 atatgcatgg caaatgaact aatatcgaca tacgtaaagc ggaggacccctcgttccatg 58500 cgtgggtgac tcgagcggtg acaaatccta gcacctccac ctccttggatggcctgtggt 58560 gacgctttcg gccccgagtt tcccttgaat acatcatcta caaggtgctattaatgtcta 58620 gtcacatcat ttacgaggcg ctattattgc ctagtctgcc cgaagatagtttagacaaca 58680 ctcttggatg gcggtgtcct tcgcccagtg atgtccaaga gcccgtggatgtttagttgt 58740 ttagacatgg tgttgggtgg tgcactagtg ggcctgatgg gccagttgtaggtccagtgg 58800 taaccaatca tgcttagcaa tagccagatg cccggattgg tgcttgttcttttttcggtg 58860 tcgacgcatg gtagtattta cttttcctgt ttttcctgat tatagcatcctaggctatac 58920 tcttctaatt tattcatgct atattaatat taaaacttgg tatggtttgtttcattcaag 58980 acccttggtg gtcaaaggct tgtttggttc aagttcattc ctagccttaccaactttttg 59040 gcaatagcaa gaaatggtca ttgaaaaaaa aaaggcaaaa attggctaggcctacagttt 59100 atttcctagc aaagttatac tttagcattc cactaagcca aataattcggcaatgccatt 59160 ttcttatcta catgccaaat atatggctaa tattttggca ttaattactcttattttttt 59220 tggcaaaatt gatcaaaagt tcacattttt agctctatag tatcaaaagttatctattca 59280 ctttaataga ccgaaagttt actcggttcc cgttttcagc actaccgtctcttttctctt 59340 gatttgccgt caattttgac cggcagtcct acccccagga gacattgagcagcagcccgt 59400 gatccccctc tctcgccgcc ggtgacgctg tggtggcata cctgctgcgggcagaataag 59460 tctggcgtca tcgccctatc gcctggagct gcaaccacca ctaccgggcccatcgatcgt 59520 ctagagcgtt atccaccctg cctgccccat tacttgcagc tccggctgggtcagaacctc 59580 tccatgaatt ttcagaatat gccatcgaat acgcgctgct tttaagatatgctacctgat 59640 tcgtgctatt ttcagaatac gctattggaa cacgaatttt cttcgttccgtgccactccg 59700 tctctcagag tcagtcgtgc cgtcgtcatc cgtctgccta gcactgtcgtcgtcagtccg 59760 ccacccgtgc ctgactgccc gttcagctgc gccgtcgtcc gtccgtcgccgccattgccg 59820 tcgtccactg tcgcgcccgc acctgcaccc gtgtcaggca cgccgtcgtccgtccgctgt 59880 caccatcatc gtcgtccact gcggcgagcg cagacagctg tggacggatgattgcgcggc 59940 aagcgcaagc ggcggtggac ggacgacggg gcggtgcacg cgagcacggtagccgatgga 60000 ctgacgttgg cggcattggg agatggacga cgacggcatg attgacgtggggaacggaat 60060 gtcacggaac ggagaaaatt cgcactccgg tggcgtatat tccgaaaatagcatgaattg 60120 ggtggcatat cctaaaaaca gtgtctattc ggtggcatat tctaaaaattctcgatcagc 60180 ataatcccca tcaatcccca atccctcaac agttgggtta atattcctggagatgtgttc 60240 ggttgtttag gttgaagttc tccacttcac ctccatgagt acatgcacctctacacgtac 60300 gttcttaatg agtttgtttg ttctatcctc cgcttgggtt ctattttgttggttccgatc 60360 tgatttgatc tggagcgggg tcgatcttcc acgacggcga gagagacgttgttcgggctg 60420 ctcgatttgg ttcaactgtt taggtcgaag ggaggggtag aattgcaattcaagtgcatg 60480 gtcagtcaat ttgggtcaaa attaacatca aactgggata aagagacgacagtgccaaaa 60540 ttggtaaccg gaaaactttg agttctatta aagtgaacca gtaactttcgttgctataga 60600 ataaaaacgt aaacttttga tgctccttgg ttgagcttgg tacaaaccaaacagacgtaa 60660 aataaacact atcctgaatc aagtctacta agttccattg aactcaaccaggatacgtac 60720 acttcctctt agaagatgtc ttgttttcac tttgtaccat tttttctattgtaaatttgg 60780 tacctcgttg tacctaggta caagaggtac catgaggtac cattttttctattgtaaatt 60840 tagtacctcg tggtacctag gtacaatgag gtaccaaatt ttacactaaaattttggtac 60900 cttatgatac ctcctcaagg accgtagaat tgctcttaat ttaattttaaaaaaaacata 60960 atatttttaa agcatattat ggaaatttta gtaattatta cttttgtaatatatgagtta 61020 cggttatact cgagataccc taaattgctt ggagatgaat aattacaaggtatatcaaag 61080 atgagttgaa aataatgcag gcattcctta gagctgcaga agttatgaaaaagaaagacg 61140 aactattaaa ggtttgggca gagcaaatac gtgacctgtt atatgacattgaagattccc 61200 ttgatgaatt taaggtccat gttgaaagtc aaaccctatt tcgtcagttggtgaaactta 61260 gatagcgtca ccggatcgct agcccgtgga tgtttagttg tttgcacatggtgctggatg 61320 gtgcgctcat ggtcttgttg taggtctggt accaaccagt catgcttagaaatagccgga 61380 tcagtgcacg gtgctaggac tttacttggt ggtctgtgca gcgctatcgacatgtggtgg 61440 tgtgcttttt ttttttccgg attacaatct catagggcta cactctagttattttgctgc 61500 tatattaata tgaaaacttg gtatggttcg tttcttttag aaaaaaacctagttgatcaa 61560 gggctagttt tcttcaagtg cattcctaat cttagcttct ttttttttttgcaatggcaa 61620 gaattgttca ttaaaaaaat tgataaaaat tggctaggcc tacgttttgtttcttaccaa 61680 agttgtactt taacaataaa ctaaggcaaa tatttcggca atgccattttcttgtctaca 61740 gaccaaatat atggctaaat tttggcataa ccattttttt gtttgcttggttgagcttgg 61800 tacaaaccaa acagacccaa aataaacagt gtcatgaatc acgtctactaaattcctttg 61860 aactgaacta gaatatagtt gctcttaaaa gatttcttga tttcactcggtaccatttac 61920 tagtacaaac ttaagattta atttttaaaa ataaaatcat aatattgttattatggaaaa 61980 tttagtcata gtacttttgt aatatatgag atgggttata cttgagatatcctaaattgc 62040 tttaagatga ataattgcta ggtatatcaa agatgagcta aaaacgatgcaagcattcct 62100 tagagctgct gaacttatga aaaagaaaga tgaactatta aaggtttgggcagagcaaat 62160 acgtgacctg tcatatgaca ttgaagattc ccttgatgaa tttaaggtccatattgaaag 62220 ccaaacccta tttcgtcagt tggtgaaact cagagaacgc caccgaattgctatccgtat 62280 ccacaacctt aaatcaagag ttgaagaagt gagtagcagg aacacacgctacagtttagt 62340 caagcctatt tcctctggca cagagattga catggattcc tatgcagaagacattcgtaa 62400 tcagtcagct cgcaatgtgg atgaagctga gcttgttggg ttttctgactccaagaaaag 62460 gttgcttgaa atgatcgata ccaatgctaa tgatggtccg gccaaagtaatctgtgttgt 62520 tgggatgggt ggtttaggca agacagctct ttcgaggaag atctttgaaagcgaagaaga 62580 cattaggaag aacttccctt gcaatgcttg gattacagtg tcacaatcatttcacaggat 62640 tgagctactc aaagatatga tacgccaact tctaggtccc aattctctgaaacaactctt 62700 gcaagaattg caagggaagg tggtggtgca agtacatcat ctttctgagtacctgataga 62760 agagctcaag gagaagaggt actttgttat tctagatgat ctatggattttacatgattg 62820 gaattggata aatgaaattg catttcctaa gaacaataag aagggcagtcgaatagtaat 62880 aaccactcgg aatgttgatc tagcggagaa gtgtgccaca gcctcactggtgtaccacct 62940 tgatttcttg cagatgaacg atgccataac attgctacta agaaaaacaaataaaaatca 63000 tgaagacatg gaatcaaata aaaatatgca aaagatggtt gaacgaattgtaaataaatg 63060 tggtcgtcta ccattagcaa tacttacaat aggagctgtg cttgcaactaaacatgtgtc 63120 agaatgggag aaattctatg aacagcttcc ttcagaacta gaaataaacccaagcctgga 63180 agctttgagg agaatggtga ccctaggtta caaccaccta ccatcccatctgaaaccatg 63240 ctttttgtat ctaagtatct ttcctgagga ttttgaaatc aaaaggaatcgtctggtagg 63300 tagatggata gcagaagggt ttgttagacc gcaggttggg atgacgactaaggatgtcgg 63360 agaaagttac tttaatgagc taatcagccg aagtatgatt caacgatcaagagtgggcat 63420 atcaggaaaa attaagactt gtcgaatcca tgatatcatc cgtgatatcacagtttcaat 63480 ctcgagacag gaaaattttg tattgttacc aatgggagat ggctctgatttagttcagga 63540 aaacactcgc cacatagcat tccatgggag tatgtcctgc aaaacaggattggattggag 63600 cattattcga tcattagcta tttttggtga cagacccaag agtctagcacatgcagtttg 63660 tctagatcaa ttgaggatgt tacgggtctt ggatcttgaa gatgtgacattcttaatcac 63720 tcaaaaagat ttcgaccgta ttgcattgtt gtgccacttg aaatacttgagtattggata 63780 ttcgtcatcc atatattcac ttcccagatc cattggtaaa ctacagggcctacaaacttt 63840 gaacatgctg agaacataca ttgcagcact accaagtgag atcagtaaactccaatgtct 63900 gcatactctt cgttgtagta gaaagtttgt ttatgacaac tttagtctaaaccacccaat 63960 gaagtgcata actaacacaa tatgcctgcc taaagtattc acacctttagttagtcgcga 64020 tgatcgtgca aaacaaattg ctgaattgca catggccacc aaaagttgctggtctgaatc 64080 attcggtgtg aaggtaccca aaggaatagg taagttgcga gacttgcaggttctagagta 64140 tgtagatatc aggcggacca gtagtagagc aatcaaagag ctggggcacttaagcaagtt 64200 gaggaaatta ggtgtgataa caaaaggctc gacaaaggaa aaatgtaagatactttatgc 64260 agccattgag aagctctctt ccctccaatc tctctatgtg aatgctgcgttattatcaga 64320 tattgaaaca cttgagtgcc tagattctat ttcatctcct cctcccctactgaggacact 64380 cgggttgaat ggaagtcttg aagagatgcc taactggatt gagcagctcactcacctgaa 64440 gaagatctac ttattgagga gcaaactaaa ggaaggtaaa accatgctgatacttggggc 64500 attgcccaac ctcatggtcc tttatcttta ttggaatgct taccttggggagaagctagt 64560 attcaaaacg ggagcattcc caaatcttag aacacttcgt atttacgaattggatcagct 64620 aagagagatg agatttgagg atggcagctc acccctgttg gaaaagatagaaatctcttg 64680 ctgcaggttg gaatcaggga ttattggtat cattcacctt ccaaggctcaaggagatttc 64740 acttgaatac aaaagtaaag tggctaggct tggtcagctg gagggagaagtgaacacaca 64800 cccaaatcgc cccgtgctgc gaatggacag tgaccgaagg gatcacgacctgggggctga 64860 agccgaagga tcttctatag aagtgcaaac agcagatcct gttcctgatgccgaaggatc 64920 agtcactgta gcagtggaag caacggatcc ccttcccgag caggagggagagagctcgca 64980 gtcgcaggtg atcacgttga cgacgaacga taggtcagtc actccctacatggcagctta 65040 attaacttgt ttctaattct cttcttgttc agtattagcc atcaggtgagggcgatgatt 65100 tcaactcact tttcatctct ctcgttttct taacctgaca gcgaagagataggcacagct 65160 caagctggct gacgatctcc tcccccatca gcgtcgtcat cagcgaacagatagggcagg 65220 gcttccctgc ttctgcgtgc acctcactgc tctgactctg agggacatgatgatcaatga 65280 ggcttccagt tcccaaatgt gcgcctaaca cactcagtca ttcccatccaaggtatgaat 65340 tgatcgatct gatttatttc ttgccgtgag gtgcaaatgt ttgatgcttactttgatcag 65400 ggtattctgt ttcgggctgt gatatgcaca attactctca attcgtgttttgatgctgag 65460 tttttatttc tcttcttacc cgtgcactct tcatttccat ttcattcataacagaaaaca 65520 atttgattgc attgtggaag ggaatatgag atcagaaatc agatggttagttgtggtttt 65580 cttatttcgt ttgctatgag caccaatcgt ttgctagaat gtctgaaagatcttgtgtac 65640 atatggtgga ctgaacaatt gaacattaca agttatcata ttttatattgttgctaaccg 65700 atcacatagt ccagtagtcc agtagtaaga tttttgttaa gtttattgttactgaatata 65760 ttgtttggcc tgcagttgtt atttctctca aaacaaaaat aatttgttagtctcaaatac 65820 aaaaagaatg acagatcaaa caagttgttt cactctacta gtttcaaattgatcatctct 65880 gtttgttctt cattcatttt ctctctgtaa gagagtttgc tagggtggagatgcggtcat 65940 gtggatgatc aaatggacta catcagacgc atcacactgc tgcccaacctttacgcactg 66000 ttgacaaatg gagtgcaggt cctaaaccag gccagaagtt tgttcagtgttcatgttccc 66060 aaacatacat tctggatggc aggttaattt attataacat tcactcttatagctttctta 66120 gtcaaaatta caaataaggt ctcctaaaaa atgcatcgac gttgatatctgtgttttctg 66180 ccatgcagaa tgacttgctc tcaatggttg aagctgcaat cttccaggctcatttctggc 66240 tgaacccaaa ttggtggctt ggaggctgga gctgcatgac atcagagatcacaatggcca 66300 ctttttgttg ccatgggtga gggtacatgg atcatgccgc tgagcctttactccgagcta 66360 gcagaaaact ggctggctgt agatcgagtt cgccatcgcc ctcaactttgtcgatgtgat 66420 ggtgatcatg gatagatgta tgtcacagcg caaagcggag actccaatgttcgcaaccat 66480 actgtaaatg tgggagggga aaatttctca aggggacaac catggaacagaggagatgga 66540 gcaaagtagt tgctcttgct atcatttcag agctcaggct gatctctagctcagatagga 66600 gtactaattc ttatgtgtgt gtatttgatc tatgatggta cgttttaagagttggaccag 66660 atattgcttg atctttacat gatgaattga tgatatgcaa aaaatggtatttgacatgga 66720 gacgcaggaa cagggtggtt cattgatctg tttgatctta atcttggagagctaggatgg 66780 aaaactgagt tggtggttat atctctacta cttttgatgt tgtccaaaactaatttgttt 66840 ctactcatgt acgtatgtaa aaaggaaaat tttaatgatt aaagaacttattgttctctt 66900 taatcagatt atttatcagt gggatttttc tttctctggt ctgtataaaaatactttttt 66960 tatcgtcaat cctcctaaaa ttcctattta taaactattt tttcttcagtttttttcctg 67020 tattttggtt ttctttcctc tacgtgacta cacattttga atcgaacatgctctgtattc 67080 ggccaactta gggccatttg aaatgtgcac tggacagttg tttagaacacttccaaggaa 67140 tttctcgtca gttgtcggcc gacggctatt taaacctgat ttaagaattccatgcgcaga 67200 gacttgtact agcgtcgaag acttgcttcg gagactcgga gtcaagtcaaggccaaggcc 67260 accacagcgt cttcgacgac gaagactcgg gttaatgctt cttcgacgacgacgacgacg 67320 ccagccacgg cgagccaagc caaccaccgc ggacctgagg gccccggccaagcgcaccca 67380 cgaaccaagt accaaccacg ccattgcttt tgtctacaaa agacatccgcacattccttc 67440 ctttccccat tgttgctcag ccactcggct tcggcttttc ctccagctccaccaaccttt 67500 cccgagatta gagagggaag aggagcacgc cgaaggatca cggtcgaacacctactatgc 67560 gcacggccgt gtgtgcacag atccattatc ccactagtgt gtgcaaagcaaagcagcata 67620 ggtaagtaac tccgattttc tgagctttgc tatttaatta agcagggaagattgtttggg 67680 tattttccgc agttttctgg aaaccatttt tttccccaat catttccatgtttagctatt 67740 tttcacgcct gtctgttaat ctaatcctgc aaaacatagc ataaagttgtaatacaagta 67800 catcgccaat ctacaacttt tttttaaaag ttagaacatt caccccaaggaagaaaaggt 67860 aaacttatga attgttggac ttggactaat cttatatttt tcccaaagacttggaggaat 67920 gttaaggaat atcactcgga ctgtccctcc tttttatttg catttcctcaagattttttt 67980 tttggggcat gtttgttgtt tggctaatgg gttagagaaa tgatttcccacttaccaaaa 68040 gacatcttta aatcctaacc attcattgtc cctttttcaa ttaatcctaacccttcattc 68100 attttccaat cccaatttca cctctcattt tccattatcc aaacatagccacggagcttg 68160 tttgtgtata ttctgttaaa ccagcggttt tgagacacta attaagcaaatttactagtt 68220 aagagattat aacgtttgag tttgggcttt ggagatgtca ctggtggtcgaaggtacgca 68280 agccctttct gctttgctac gaagaacctc cgagctcagt caatgtcaggaaaacaaact 68340 ttgataacta caagtggaag ccatttttgg ttggttttat gcatgatctgtttgtccctg 68400 ttattatgtc ctgcaaactt ttcattgccc ggtttgatta atacatgtctatttgatatt 68460 gcagataaat acagttgtag cctgaaagat agaacttaca atctattgcttaaaagttta 68520 aagaaagttg ttcagagaag gaagatcaga tggcggatac agtactcagcattgcaaagt 68580 ccctggtggg aagtgctgta agtaaggttg cttcggttgc tgccgacaagatgatcttgc 68640 tgctgggagt gcagaaggag atatggtgag catctgaatc gcagcctaattaattttatt 68700 ttccagttgc attagatttt ttgggaccac acttatgcag agtggtatggtactcagttt 68760 atttttagtg atttatttat atttttatac catatttgat gaatgcatatgcagtttttt 68820 tttaataagt atattatttg caacatcttc gggagataca tgtagaggaaagtcaaagta 68880 gaatgaaatg gaagttcctc tatataccct gtaggatcga aatagaagaactaagccaac 68940 cagaggggag ggggtgaatg gttagtatac ccaaaaacgt aaactttttgcagaaattaa 69000 agttaccctc agattcgatg tagatcggtc tgaccggagt agtctcgtcaggtctgaccg 69060 cgcctgtgcc gccggtctga ccgcagtagg ttgcccggtc cgaccgctctgtagtcgcct 69120 tagcctcctg atgctgatgc tgctgtgacc gtgtcacgcc cggaatttctatccaaaatt 69180 ccaaacgctt acatgtgtgt gaacccttgt ccaggaatca gccgaggcacacaataacaa 69240 attgataata gagtacaatt attactctaa ttaataagcg tataaaatgtcattacagag 69300 gtagatagtt cctctcaaac aataaagatc taaacagcgg aaaaagataaacggcgcaga 69360 cggctccact ccacaggcag cttgaccaag gctacaccta atcctccacaccatcagctt 69420 cactgtagaa ctcttcctct gatgaatgat tgcaaggtga gtatatgacatactcagcaa 69480 gccacgcagc aaatatgcaa gtgcacagga taacaaagga tggcatagtagggtttcttt 69540 tgcaaaagca gcatttagca aacatttgag aatttaataa aacagttaagtaattaaaca 69600 atattaatcc aacgctatac aacataccct gttgcatagg cccaaccattctgaacaacc 69660 accccggctg cacagttcta tctccaaacc aggaatatac cattccaaaccaggagctaa 69720 tcaaatcatt accaattata gcatcattaa ctatggtgag aagggtgagactaatcacga 69780 aagacattgt tagacccgcc cataaccgcg ggcacggcta ttcgaatagttttactctga 69840 tcagaggtgt accactgtac ccacaagaca caaccccaca tcatgtcaccatgtgcctca 69900 ataccaccac ggtacctcgg aaaggagttg tgacaatacc cctcgcataacacaatccac 69960 tgcagtgcat cttcctggat cataatcacc cccttataaa caaggcatggactccccagc 70020 gacccccgtg ggcttatctc cgccacttct cagtctggtg ctccacaatgaaccatacta 70080 tacaaaagat aaagccgttg cccatgctgg cttgtggttg gcacgataaatgtttcacaa 70140 ccgaaactcg tgaaccggtc cttaattgtc atgagcacga ccatcaaaaccatgtgctca 70200 caacccacca ttatcaggtt ttagttggca cattaattaa ttaactaatcacaattgacc 70260 atcgtgaact atcaataagc catcatgaaa taatagtgaa tcataagttatcccaatagt 70320 gagctaatgt ttctaagcag ggctaagcaa ttatatctaa tatctagttgaaccaataca 70380 taaagctcaa ctagtcaaat tattataacc caaggtatca aggaataaggtaaacaagaa 70440 caaaagggct ataacaaaga ataggttaat tccacccaat gacattcgaaaaataaatgc 70500 aatagttgaa taaaaacaat agctttaaat aggatcaaca tgctcaaagggttgtttggg 70560 atctgtgtga cttgccttgc tggccttgga actcttcaaa ctcttctccggcgaaaacgg 70620 actctccgga aacgacggaa tctaagcaga aaagagcaaa atcaccaaaacagcacataa 70680 acaagcatga acagtacatg tggatatttt taatgtgtag atctcaattttagaaaaatt 70740 tagagacttg aaccaactaa atcggagcta agatgaatta gttatgaatttttaaagatt 70800 aaatcggatt aaaacactta aatcggctta aattgaatta tgacgcaataacgaattatt 70860 tttgaaaagg aaaaggggat ttattgcgtc agcggctagg gtttcggtgggccgggcgca 70920 cgggcggcgg ttcacgcgaa cggacggccg agatctaacc gatccaagatggacggccga 70980 gatcgatcga gtgcacggcg gctcacggcg gaagggcatg atgacgtcagcgatgacgtc 71040 accgacgacg gcggcggctc ggcggctgga acggctcggg cggcgcatgctcgccggcga 71100 acggcggcgc gaggacgcaa acggagggca ccaacggata gcggacggcgcggcgaactc 71160 accggcgacc aaagcgacgg cggagaggca acggacggcg acggcgacgaggaggaaacg 71220 gcgaagctct tcgggttgac gacggcgacg atgctccggt ggtcttcggcggcggcgaag 71280 gggcggacga ggacggcgac gcgacggcga acacgacggc ggccttcccgagcgacggcg 71340 acggctggaa cgacggcggc acacggctgg agcggcggcg gcgacggcaaagctagggca 71400 cacggcgcta gggctcttcc ggcgacgaga ggtgaaggcg aaggtggcggcgagaagaga 71460 agagcacggg gatcctttta aaggggctcg gaggcgacgg cggaggcccacggcgaccgg 71520 cgacgaggaa ggaaagctcg ggaagaacga atccgagacg aactcgaatccaacgctttc 71580 caaagcgatt taaccgatga ttccaaagga gaaaaggtag aggagatcccggagattgtt 71640 tcccctcaac caattcggcc ggagaaggaa agggacggcc ggattggaaggaaacagcgg 71700 cggcggctcg gcggcggcgc gctagggttt cggcccgagg aagacgacgggcctgacagg 71760 tggaccccac ctgtcagcgg cagcgggcgc gcgcgcgcgg cagcggactgggccggactg 71820 ggccgaggag agagagagag ggttttgggc cgactttcgg cccaaagccaaaagaagact 71880 ttttaaaact ttttcaattt aaattatttc ttaaatgcaa ttccatttattaaaaatact 71940 ttcttagctc aaataaatcc cagaaaaatc taggaattat agaattaagtaaagtattta 72000 atgaaatttt atctggcccc attttatatt gaaatttatt tattaaaattagattttctt 72060 ttcaagactt ttaaaaatta tttctaaaaa ttccaattaa acaacaatttatataatttg 72120 aatttttagg gtgtgacaga ccgccgttgc tccgccggtt tgaccgcctccttgccgtcg 72180 gtttgaccgc cggtgagttg ccggttagac cgccgaaccc gataaaacacaaatcgaaga 72240 actcttaaag tagatgacaa ccttattact tatctctgtg tttacaaagtgcaacaacaa 72300 tactccttac aaaaaaactc gactaaactc gaaaccctaa ctaaactatcaactcaattg 72360 ctctcaaaag cgataccgcg aagcctcacg ctccctctct atttatacatgaggtaggca 72420 gcctaaagcc acgaactaaa ctcatactag aagtcctaat ccacctaagaaaccttcttg 72480 tacaagaaac aaactttaca aactccaatc ataccaaatt tggactcctttcaaattcga 72540 ctccgcattc cacacacaca atacctccat cgtatgccat atggaatcttcaccaaccac 72600 gtgcattgaa ctctagccta aatatcccgc atgatatctg accgtcacggacatcgtctt 72660 atctccaagc cgactcaaga tccatcaccg gcgatactct cccgaggcgtcaagtcacct 72720 acacatgaat caaacaaaga aaccgtattc cgagaccaag ctatctccaacttgactcat 72780 tagtagcaaa caacagtata acatacgcat agtatccatc tagaagccataaatatgaaa 72840 caatcacgga tatccaaaca aacaacccaa aaccgaaacc gacacagagtcggccggtca 72900 gaccgcgggc tggccgatct gaccgagtga tacacgtcgg tctgaccagaaccgacaaat 72960 cagcagcacc tgtttaacac ctgcaaatcc aatcatctcc aaaaccacttcgcaaataaa 73020 ttccaaatat caaaaccaat aatctcagat gccaattgtt catcacagaataataataaa 73080 aaacaccttt gattttacat acccaaacag aagtggacat agctttctcatggcaaccat 73140 gttcaatgaa ggaatacaaa cgagccagct atttgtagga tcgaatcacaagaactaagc 73200 ctaccagagg ggggtaaatg gttggtatac ccaaaaaccg aaaacttttagcggaaataa 73260 aagttaccct caatttcgat gagatcggtc tgaccggagt agattagccggtctgaccga 73320 agcgtagccg ccggtctgac cggtgttgat cttccggttg gactgccctggaatccctgc 73380 cgcgcctgat gtcgccaccg gtctgaccgc aggtcacctg ccggtctgaccgccgtgatg 73440 ccgccggtct gaccgccggt gcgccgccgg ttagaccgcc gaaacccggtgaaacacaaa 73500 tcgaagaact cttaaagtag atgacgactt tattgattct ctctgtgtttacaaagtgca 73560 ccaacagcac tccttacaaa aatttcgact aaactcgaaa ccctaactcaaaactcaact 73620 caattgctct caaaagcgat accgggaagc ctcacgctcc ccctctatttatacccaagg 73680 taggcagtct aaagccatga accaaactca tactaagagt cctaaacaccttaggaaacc 73740 ctctagtaca agaaagaaac tttacataac caatcgtatc aaattcggactccttccaaa 73800 ttcgactccg catcccatac acatacaata cctccatcgt atgccatatagaaactccat 73860 caaccacgtg catcaattct agcctaagta tcctgcatga tctttgaccaccacggacgt 73920 cgtcttatcc ccaagccgac tcccggtcca tcaccgcaaa tactctcccgagacatcgag 73980 tcacctacac atgaaataaa caaagaaacc atattccgag accaagctatctccaacttg 74040 actcattagt agcaaacaat agtattacat acgtatagta tccatctagaagtcataatc 74100 atgaaataat cacggatatc caaacaaaca acccgaaacc gaaaacgacacagcgtcggc 74160 tggtcagacc gcgggctggc cggtctgacc gctcgatcac cgccggtctgaccggcacat 74220 actgcctggt ctgaccggtc acataaaatg atagaatcct gcgatcacctgtaaaatcca 74280 atcatctcca aaaacacttc gtgaataaat tccaaataac aaaaccactaatctccaatg 74340 cccaattgtt cgtcacagaa taataatcaa aaacaccttt tattttacactattatggat 74400 ctggtggtaa tctaatttag acagaaacat ggttgccttc gagagtaaacaatgtagcat 74460 gccatgctta agtcaaatgc atatacaatt ggacaaggaa ctattacgtttcagtgctgt 74520 tcaattttgc ttttgacctt ggaaccaaat atataatgca aattaatacgttcctagttt 74580 aagcttccaa tttgttcata ttttgtgatt tataatgcta tatatgcatgtatacagaca 74640 agcaacgcag tccacgtggg atgttggcca cgactggatg agctgagcaaaatttagacc 74700 tgtgtcaaga aaatgtctga gttcctttga acattttgtt tacaggttcatcaaagacga 74760 gctacaaacg atacaagcat ttttgatggc tgccgaagca tcaaagaaaagcatactgtt 74820 gaaggtttgg gtgcagcaag taagggatct atcctatgac attgaagattgccttgatga 74880 atttacagtt catgtgggca gccaaacctt gtcgaggcag ttgatgaagctaaaggatcg 74940 ccatcggatc gccgttcaga tccgcaatct caggacaaga atcgaagaagtgagcagtag 75000 gaacacacgc tacaacttaa tagagaatga ccttaccagc accactggtgagaggaattt 75060 cattatggaa gacattcgaa atgaatcagc taacaatatc gaggaagctgagcttgtggg 75120 tttttcggga cccaaaagag agttgcttga tcttatagat gtccatgccaaggatggacc 75180 tacaaaggtt gtatgtgttg tcggtatggg tggtttgggt aagactactattgcaaggaa 75240 aatttatgaa agcaaagagg acattgcaaa gaatttttct tgctgtgcttggattactgt 75300 ttcacagtcc tttgttaggt tggaactact caaggatttg atggtgaaactttttggaga 75360 ggaagtactc aagaagcggc tgagagaact tgaagggaag gttccacaagtagacgacct 75420 cgccagctac ctcaggacag agttaaatga aaggaggtac tttgttgtgcttgatgatgt 75480 gtggagtaca gattcatgga aatggattaa tagtattgcc ttccctagaaataacaataa 75540 agggagccgg gtgatagtaa caacaagaga tgttggctta gctaaggagtgtacttctga 75600 attgcttatc taccggctta aacccctaga aataacctat gcgaaagagttgcttctaag 75660 gaaagcaaat aaaaaaattg aagatatgaa aagtgataaa aagatgagtgacatcattac 75720 taaaatagta aaaaagtgtg gctatttacc actggctata ctcacaataggaggcgtgct 75780 tgccaccaaa gatgtaagag agtgggaaac cttttatagt cagataccttcagagcttga 75840 gagcaaccca aaccttgaag caatgagaag gatagtgacc ctaagttacaactacttgcc 75900 atctcatctt aagcaatgct ttttgtatct aagcatattt cctgaggattttgaaattaa 75960 taggaaccgt ctggtaaata gatggatggc agaggggttt attaaagctagggctaatat 76020 gactattgaa gatgttggga aatgttactt taaagaactt atcaaccgtagcatgattta 76080 gccatcaaga gcgggtgtac gaggagaatt taagagctgt cgaatccacgacatcatgcg 76140 tgatattgca atttcgattt ctagagagga aaatttcata ctcttacctgagggcactga 76200 ctatgacgta gtgcatggga acactcggca catagcattt cacgggagtaagtattgctc 76260 tgaaacaagc tt 76272 84 2940 DNA Oryza minuta CDS(1)..(2940) 84 atg gcg gag acg gtg gtg agc atg gcg atg tcg gtg ctg ggcagc gcc 48 Met Ala Glu Thr Val Val Ser Met Ala Met Ser Val Leu Gly SerAla 1 5 10 15 gtc ggg aag gcc gcc tcc gcc gcc gcc gac gag gcc acc ctcctg ctc 96 Val Gly Lys Ala Ala Ser Ala Ala Ala Asp Glu Ala Thr Leu LeuLeu 20 25 30 ggc gtc cag aag gag atc tgg tac atc aag gac gag ctg aaa actatt 144 Gly Val Gln Lys Glu Ile Trp Tyr Ile Lys Asp Glu Leu Lys Thr Ile35 40 45 cag gca ttc tta aga gct gct gaa gta aca aag aag aaa gat gac ttg192 Gln Ala Phe Leu Arg Ala Ala Glu Val Thr Lys Lys Lys Asp Asp Leu 5055 60 cta aag gta tgg gca gag caa gta cga gat ctg tca tat aac att gaa240 Leu Lys Val Trp Ala Glu Gln Val Arg Asp Leu Ser Tyr Asn Ile Glu 6570 75 80 gat tgc cta gac gaa ttc aag gtt cat gtt gag agc caa agc ttg gca288 Asp Cys Leu Asp Glu Phe Lys Val His Val Glu Ser Gln Ser Leu Ala 8590 95 aag caa cta atg aag ctt ggt gaa cgc cat cga att gct gta cag att336 Lys Gln Leu Met Lys Leu Gly Glu Arg His Arg Ile Ala Val Gln Ile 100105 110 ctc aac tta aaa tca aga att gaa gaa gtg agc aac agg aat aca cgc384 Leu Asn Leu Lys Ser Arg Ile Glu Glu Val Ser Asn Arg Asn Thr Arg 115120 125 tac agc tta atc aag ctc att tcc tct ata acc aca gag gat gag agg432 Tyr Ser Leu Ile Lys Leu Ile Ser Ser Ile Thr Thr Glu Asp Glu Arg 130135 140 gat tcc tac cta gaa gat gct cgc aat cga tca ggt agc aac act gac480 Asp Ser Tyr Leu Glu Asp Ala Arg Asn Arg Ser Gly Ser Asn Thr Asp 145150 155 160 gag tca gaa ctt gtg ggc ttt gcc aag act aaa gat gag ttg cttaaa 528 Glu Ser Glu Leu Val Gly Phe Ala Lys Thr Lys Asp Glu Leu Leu Lys165 170 175 ctg ata gat gtc aat act aat gac ggt cca gct aaa gtg ata tgtgtg 576 Leu Ile Asp Val Asn Thr Asn Asp Gly Pro Ala Lys Val Ile Cys Val180 185 190 gtt gga atg ggt gga tta ggc aag act acc ctt gca agg aag gcatat 624 Val Gly Met Gly Gly Leu Gly Lys Thr Thr Leu Ala Arg Lys Ala Tyr195 200 205 gaa aac aag gaa cac atg aag aac ttc tcg tgt tgt gct tgg atcact 672 Glu Asn Lys Glu His Met Lys Asn Phe Ser Cys Cys Ala Trp Ile Thr210 215 220 gtg tct cag tca ttt gac agg aaa gaa att ctg aaa caa atg atcagg 720 Val Ser Gln Ser Phe Asp Arg Lys Glu Ile Leu Lys Gln Met Ile Arg225 230 235 240 caa ctt ctg ggt gct gat tca tta gac aaa ctc ttg aaa gaattt agt 768 Gln Leu Leu Gly Ala Asp Ser Leu Asp Lys Leu Leu Lys Glu PheSer 245 250 255 gag aag ttg ctc gtg caa gtc cag cat ctc gct gat cac ttggtt gaa 816 Glu Lys Leu Leu Val Gln Val Gln His Leu Ala Asp His Leu ValGlu 260 265 270 ggg cta aag gag aaa agg tac ttt gtt gtc ctt gat gac ctatgg acc 864 Gly Leu Lys Glu Lys Arg Tyr Phe Val Val Leu Asp Asp Leu TrpThr 275 280 285 ata gat gca tgg aat tgg att cat gat act gct ttt ccg aagatt aac 912 Ile Asp Ala Trp Asn Trp Ile His Asp Thr Ala Phe Pro Lys IleAsn 290 295 300 aac aga ggt agt cgc ata ata ata aca acg cga gat gct ggctta gct 960 Asn Arg Gly Ser Arg Ile Ile Ile Thr Thr Arg Asp Ala Gly LeuAla 305 310 315 320 gga agg tgt acc tct gaa tca ctt att tac cac ctt gaaccg tta cat 1008 Gly Arg Cys Thr Ser Glu Ser Leu Ile Tyr His Leu Glu ProLeu His 325 330 335 ata gat gat gct ata cac ttg cta cta gca aag aca aacata aga ctt 1056 Ile Asp Asp Ala Ile His Leu Leu Leu Ala Lys Thr Asn IleArg Leu 340 345 350 gaa gac atg gaa aat gat gag gac ttg ggc agc ata gttaca aaa ttg 1104 Glu Asp Met Glu Asn Asp Glu Asp Leu Gly Ser Ile Val ThrLys Leu 355 360 365 gta aaa agg tgt ggt tat tta ccg ctg gct ata ctc acaata gga ggc 1152 Val Lys Arg Cys Gly Tyr Leu Pro Leu Ala Ile Leu Thr IleGly Gly 370 375 380 att ctt gct act aag aag ata atg gaa tgg gga aaa ttttac aga gaa 1200 Ile Leu Ala Thr Lys Lys Ile Met Glu Trp Gly Lys Phe TyrArg Glu 385 390 395 400 ctt cct tca gag ctt gag agc aat cca agc cta gaagcc atg agg agg 1248 Leu Pro Ser Glu Leu Glu Ser Asn Pro Ser Leu Glu AlaMet Arg Arg 405 410 415 atg gtg acc cta agc tac aat cac tta cca tct catctt aaa cca tgc 1296 Met Val Thr Leu Ser Tyr Asn His Leu Pro Ser His LeuLys Pro Cys 420 425 430 ttt ctt tac cta agt att ttc cct gaa gat ttt gaaatt caa aga ggg 1344 Phe Leu Tyr Leu Ser Ile Phe Pro Glu Asp Phe Glu IleGln Arg Gly 435 440 445 cgc ctg gta gat aga tgg ata gca gag ggt ttt gtcaga gcc aca gat 1392 Arg Leu Val Asp Arg Trp Ile Ala Glu Gly Phe Val ArgAla Thr Asp 450 455 460 ggg gtg aac att gag gat gtt gga aat agt cac tttaat gag ctt atc 1440 Gly Val Asn Ile Glu Asp Val Gly Asn Ser His Phe AsnGlu Leu Ile 465 470 475 480 aac aga agt ctg att cag ccc tca aaa gtt agtaca gat gga gtt gtt 1488 Asn Arg Ser Leu Ile Gln Pro Ser Lys Val Ser ThrAsp Gly Val Val 485 490 495 aag aga tgt cga atc cat gat atc atg cgt gatatc ata gtt tca att 1536 Lys Arg Cys Arg Ile His Asp Ile Met Arg Asp IleIle Val Ser Ile 500 505 510 tct aga gag gaa aat ttt gtg ctg ttg act agggag aag atc act gtt 1584 Ser Arg Glu Glu Asn Phe Val Leu Leu Thr Arg GluLys Ile Thr Val 515 520 525 gta gcg gag gag agc atc cgc cat cta gca tttcat ggg agc aaa tgc 1632 Val Ala Glu Glu Ser Ile Arg His Leu Ala Phe HisGly Ser Lys Cys 530 535 540 tca aag ata tgc ttg gag tgg aac cat ctg cgctca gta act ttg ttt 1680 Ser Lys Ile Cys Leu Glu Trp Asn His Leu Arg SerVal Thr Leu Phe 545 550 555 560 ggt gac aga cct gcg ggg cga aca cct gcactt tgt tca cca caa ttt 1728 Gly Asp Arg Pro Ala Gly Arg Thr Pro Ala LeuCys Ser Pro Gln Phe 565 570 575 agg atg ctg aga gtg ttg gat ctg gaa gatgca aaa ttc aaa ttc aca 1776 Arg Met Leu Arg Val Leu Asp Leu Glu Asp AlaLys Phe Lys Phe Thr 580 585 590 caa aat gat atc agg aat ata ggg ttg ttgcgc cac atg aaa tat ttg 1824 Gln Asn Asp Ile Arg Asn Ile Gly Leu Leu ArgHis Met Lys Tyr Leu 595 600 605 aat ttt gca aga gcc tca act att tat acactt cca agg tcc ata gga 1872 Asn Phe Ala Arg Ala Ser Thr Ile Tyr Thr LeuPro Arg Ser Ile Gly 610 615 620 aaa ttg cag tgc ttg caa att ttg aac atgagg gag gca aat atc tca 1920 Lys Leu Gln Cys Leu Gln Ile Leu Asn Met ArgGlu Ala Asn Ile Ser 625 630 635 640 gca cta aca act gag gtg act aaa ctccag aat ctc cgt agc ctc cga 1968 Ala Leu Thr Thr Glu Val Thr Lys Leu GlnAsn Leu Arg Ser Leu Arg 645 650 655 tgc agc agg agg tca ggt tct ggt tacttt agc ata ata gat aat ccc 2016 Cys Ser Arg Arg Ser Gly Ser Gly Tyr PheSer Ile Ile Asp Asn Pro 660 665 670 aag gaa tgc ttg atg atc acc atg tgctta ccg atg gtt ttc tca act 2064 Lys Glu Cys Leu Met Ile Thr Met Cys LeuPro Met Val Phe Ser Thr 675 680 685 tca ata aat ttc agt gac cgt gtg aagtta att cct gag ata tgc atg 2112 Ser Ile Asn Phe Ser Asp Arg Val Lys LeuIle Pro Glu Ile Cys Met 690 695 700 tca tgt tct acc cgt tgg tct gat acaaag ggt gtg agg gtg cca aga 2160 Ser Cys Ser Thr Arg Trp Ser Asp Thr LysGly Val Arg Val Pro Arg 705 710 715 720 gga att gac aac cta aaa gag ttacag att cta gaa gtc gtg gac atc 2208 Gly Ile Asp Asn Leu Lys Glu Leu GlnIle Leu Glu Val Val Asp Ile 725 730 735 aac aga act agt agg aag gcg attgaa gag ctg ggg gag cta att cag 2256 Asn Arg Thr Ser Arg Lys Ala Ile GluGlu Leu Gly Glu Leu Ile Gln 740 745 750 tta aga aaa tta agc gtg aca acaaaa ggc gcc aca aat aag aag tat 2304 Leu Arg Lys Leu Ser Val Thr Thr LysGly Ala Thr Asn Lys Lys Tyr 755 760 765 cag ata ttt tgt gca gcg att gagaag ctc tct tct ctg caa tct ctc 2352 Gln Ile Phe Cys Ala Ala Ile Glu LysLeu Ser Ser Leu Gln Ser Leu 770 775 780 cgt gtg gat gct gag gga ttc tcagat act gga aca ctt gag tgg ctc 2400 Arg Val Asp Ala Glu Gly Phe Ser AspThr Gly Thr Leu Glu Trp Leu 785 790 795 800 aat tcg att gca tgt cct cctcca ttc ttg aag aca ctc aag ttg aat 2448 Asn Ser Ile Ala Cys Pro Pro ProPhe Leu Lys Thr Leu Lys Leu Asn 805 810 815 gga tct ctt gca gat aca ccaaac tgg ttt ggg aac ctt aag cag ctg 2496 Gly Ser Leu Ala Asp Thr Pro AsnTrp Phe Gly Asn Leu Lys Gln Leu 820 825 830 gtg aag atg tgc tta tcc agatgt gtg cta caa gat ggt aaa act atg 2544 Val Lys Met Cys Leu Ser Arg CysVal Leu Gln Asp Gly Lys Thr Met 835 840 845 gag ata ctt ggg gca ctg cccaac ctt atg gtt ctt cgt ctt tat cgc 2592 Glu Ile Leu Gly Ala Leu Pro AsnLeu Met Val Leu Arg Leu Tyr Arg 850 855 860 aac gca tat gct gac gag aaaatg gca ttc aga agg gga aca ttc cca 2640 Asn Ala Tyr Ala Asp Glu Lys MetAla Phe Arg Arg Gly Thr Phe Pro 865 870 875 880 aat ctc agg tgt ctt gatatt tac ttg ctg aag caa ctt aga gag ata 2688 Asn Leu Arg Cys Leu Asp IleTyr Leu Leu Lys Gln Leu Arg Glu Ile 885 890 895 aga ttt gag gag ggc acctcg cca aca atg gaa agt ata gaa att tat 2736 Arg Phe Glu Glu Gly Thr SerPro Thr Met Glu Ser Ile Glu Ile Tyr 900 905 910 ggt tgc agg ttg gaa tcaggg att att ggt atc aag cac ctt cca aga 2784 Gly Cys Arg Leu Glu Ser GlyIle Ile Gly Ile Lys His Leu Pro Arg 915 920 925 ctt aag att att tcg cttgaa tat gat ggt aaa gtc gcg aag ctt gat 2832 Leu Lys Ile Ile Ser Leu GluTyr Asp Gly Lys Val Ala Lys Leu Asp 930 935 940 gtg ctg caa gag gaa gtgaat aca cac ccc aat cat act gaa ctg caa 2880 Val Leu Gln Glu Glu Val AsnThr His Pro Asn His Thr Glu Leu Gln 945 950 955 960 atg gca gag gat cgaagt cat cat gac cta gga ggt ata tta tat aag 2928 Met Ala Glu Asp Arg SerHis His Asp Leu Gly Gly Ile Leu Tyr Lys 965 970 975 gat caa ata tag 2940Asp Gln Ile 85 979 PRT Oryza minuta 85 Met Ala Glu Thr Val Val Ser MetAla Met Ser Val Leu Gly Ser Ala 1 5 10 15 Val Gly Lys Ala Ala Ser AlaAla Ala Asp Glu Ala Thr Leu Leu Leu 20 25 30 Gly Val Gln Lys Glu Ile TrpTyr Ile Lys Asp Glu Leu Lys Thr Ile 35 40 45 Gln Ala Phe Leu Arg Ala AlaGlu Val Thr Lys Lys Lys Asp Asp Leu 50 55 60 Leu Lys Val Trp Ala Glu GlnVal Arg Asp Leu Ser Tyr Asn Ile Glu 65 70 75 80 Asp Cys Leu Asp Glu PheLys Val His Val Glu Ser Gln Ser Leu Ala 85 90 95 Lys Gln Leu Met Lys LeuGly Glu Arg His Arg Ile Ala Val Gln Ile 100 105 110 Leu Asn Leu Lys SerArg Ile Glu Glu Val Ser Asn Arg Asn Thr Arg 115 120 125 Tyr Ser Leu IleLys Leu Ile Ser Ser Ile Thr Thr Glu Asp Glu Arg 130 135 140 Asp Ser TyrLeu Glu Asp Ala Arg Asn Arg Ser Gly Ser Asn Thr Asp 145 150 155 160 GluSer Glu Leu Val Gly Phe Ala Lys Thr Lys Asp Glu Leu Leu Lys 165 170 175Leu Ile Asp Val Asn Thr Asn Asp Gly Pro Ala Lys Val Ile Cys Val 180 185190 Val Gly Met Gly Gly Leu Gly Lys Thr Thr Leu Ala Arg Lys Ala Tyr 195200 205 Glu Asn Lys Glu His Met Lys Asn Phe Ser Cys Cys Ala Trp Ile Thr210 215 220 Val Ser Gln Ser Phe Asp Arg Lys Glu Ile Leu Lys Gln Met IleArg 225 230 235 240 Gln Leu Leu Gly Ala Asp Ser Leu Asp Lys Leu Leu LysGlu Phe Ser 245 250 255 Glu Lys Leu Leu Val Gln Val Gln His Leu Ala AspHis Leu Val Glu 260 265 270 Gly Leu Lys Glu Lys Arg Tyr Phe Val Val LeuAsp Asp Leu Trp Thr 275 280 285 Ile Asp Ala Trp Asn Trp Ile His Asp ThrAla Phe Pro Lys Ile Asn 290 295 300 Asn Arg Gly Ser Arg Ile Ile Ile ThrThr Arg Asp Ala Gly Leu Ala 305 310 315 320 Gly Arg Cys Thr Ser Glu SerLeu Ile Tyr His Leu Glu Pro Leu His 325 330 335 Ile Asp Asp Ala Ile HisLeu Leu Leu Ala Lys Thr Asn Ile Arg Leu 340 345 350 Glu Asp Met Glu AsnAsp Glu Asp Leu Gly Ser Ile Val Thr Lys Leu 355 360 365 Val Lys Arg CysGly Tyr Leu Pro Leu Ala Ile Leu Thr Ile Gly Gly 370 375 380 Ile Leu AlaThr Lys Lys Ile Met Glu Trp Gly Lys Phe Tyr Arg Glu 385 390 395 400 LeuPro Ser Glu Leu Glu Ser Asn Pro Ser Leu Glu Ala Met Arg Arg 405 410 415Met Val Thr Leu Ser Tyr Asn His Leu Pro Ser His Leu Lys Pro Cys 420 425430 Phe Leu Tyr Leu Ser Ile Phe Pro Glu Asp Phe Glu Ile Gln Arg Gly 435440 445 Arg Leu Val Asp Arg Trp Ile Ala Glu Gly Phe Val Arg Ala Thr Asp450 455 460 Gly Val Asn Ile Glu Asp Val Gly Asn Ser His Phe Asn Glu LeuIle 465 470 475 480 Asn Arg Ser Leu Ile Gln Pro Ser Lys Val Ser Thr AspGly Val Val 485 490 495 Lys Arg Cys Arg Ile His Asp Ile Met Arg Asp IleIle Val Ser Ile 500 505 510 Ser Arg Glu Glu Asn Phe Val Leu Leu Thr ArgGlu Lys Ile Thr Val 515 520 525 Val Ala Glu Glu Ser Ile Arg His Leu AlaPhe His Gly Ser Lys Cys 530 535 540 Ser Lys Ile Cys Leu Glu Trp Asn HisLeu Arg Ser Val Thr Leu Phe 545 550 555 560 Gly Asp Arg Pro Ala Gly ArgThr Pro Ala Leu Cys Ser Pro Gln Phe 565 570 575 Arg Met Leu Arg Val LeuAsp Leu Glu Asp Ala Lys Phe Lys Phe Thr 580 585 590 Gln Asn Asp Ile ArgAsn Ile Gly Leu Leu Arg His Met Lys Tyr Leu 595 600 605 Asn Phe Ala ArgAla Ser Thr Ile Tyr Thr Leu Pro Arg Ser Ile Gly 610 615 620 Lys Leu GlnCys Leu Gln Ile Leu Asn Met Arg Glu Ala Asn Ile Ser 625 630 635 640 AlaLeu Thr Thr Glu Val Thr Lys Leu Gln Asn Leu Arg Ser Leu Arg 645 650 655Cys Ser Arg Arg Ser Gly Ser Gly Tyr Phe Ser Ile Ile Asp Asn Pro 660 665670 Lys Glu Cys Leu Met Ile Thr Met Cys Leu Pro Met Val Phe Ser Thr 675680 685 Ser Ile Asn Phe Ser Asp Arg Val Lys Leu Ile Pro Glu Ile Cys Met690 695 700 Ser Cys Ser Thr Arg Trp Ser Asp Thr Lys Gly Val Arg Val ProArg 705 710 715 720 Gly Ile Asp Asn Leu Lys Glu Leu Gln Ile Leu Glu ValVal Asp Ile 725 730 735 Asn Arg Thr Ser Arg Lys Ala Ile Glu Glu Leu GlyGlu Leu Ile Gln 740 745 750 Leu Arg Lys Leu Ser Val Thr Thr Lys Gly AlaThr Asn Lys Lys Tyr 755 760 765 Gln Ile Phe Cys Ala Ala Ile Glu Lys LeuSer Ser Leu Gln Ser Leu 770 775 780 Arg Val Asp Ala Glu Gly Phe Ser AspThr Gly Thr Leu Glu Trp Leu 785 790 795 800 Asn Ser Ile Ala Cys Pro ProPro Phe Leu Lys Thr Leu Lys Leu Asn 805 810 815 Gly Ser Leu Ala Asp ThrPro Asn Trp Phe Gly Asn Leu Lys Gln Leu 820 825 830 Val Lys Met Cys LeuSer Arg Cys Val Leu Gln Asp Gly Lys Thr Met 835 840 845 Glu Ile Leu GlyAla Leu Pro Asn Leu Met Val Leu Arg Leu Tyr Arg 850 855 860 Asn Ala TyrAla Asp Glu Lys Met Ala Phe Arg Arg Gly Thr Phe Pro 865 870 875 880 AsnLeu Arg Cys Leu Asp Ile Tyr Leu Leu Lys Gln Leu Arg Glu Ile 885 890 895Arg Phe Glu Glu Gly Thr Ser Pro Thr Met Glu Ser Ile Glu Ile Tyr 900 905910 Gly Cys Arg Leu Glu Ser Gly Ile Ile Gly Ile Lys His Leu Pro Arg 915920 925 Leu Lys Ile Ile Ser Leu Glu Tyr Asp Gly Lys Val Ala Lys Leu Asp930 935 940 Val Leu Gln Glu Glu Val Asn Thr His Pro Asn His Thr Glu LeuGln 945 950 955 960 Met Ala Glu Asp Arg Ser His His Asp Leu Gly Gly IleLeu Tyr Lys 965 970 975 Asp Gln Ile 86 2982 DNA Oryza minuta CDS(1)..(2982) 86 atg act gag acg gtg gta agc atg gcc atg tcg ctg gtc gggagt gcc 48 Met Thr Glu Thr Val Val Ser Met Ala Met Ser Leu Val Gly SerAla 1 5 10 15 atc aga gtg gct tct tct gct tcc agc caa gag atg agc atgctg att 96 Ile Arg Val Ala Ser Ser Ala Ser Ser Gln Glu Met Ser Met LeuIle 20 25 30 ggt gtg caa aac gag att tgg ttc ata aaa gat gag ctg aag acaatg 144 Gly Val Gln Asn Glu Ile Trp Phe Ile Lys Asp Glu Leu Lys Thr Met35 40 45 cag gca ttc ctg agg gcg gct gaa gta agg aaa gat caa gat gaa ctc192 Gln Ala Phe Leu Arg Ala Ala Glu Val Arg Lys Asp Gln Asp Glu Leu 5055 60 gtg aag gtg tgg gct gag caa gtg aga gat cta gcc tac gac att gaa240 Val Lys Val Trp Ala Glu Gln Val Arg Asp Leu Ala Tyr Asp Ile Glu 6570 75 80 gat tgt ctt cag gag ttt gct gtc cat gtt ggg cat caa agc cta tca288 Asp Cys Leu Gln Glu Phe Ala Val His Val Gly His Gln Ser Leu Ser 8590 95 atg cag ctg att aag ctc tgc cac cga cac cga atc gct gtc cag att336 Met Gln Leu Ile Lys Leu Cys His Arg His Arg Ile Ala Val Gln Ile 100105 110 cgg aac ctg aaa cag agg gtt gaa gaa gtg agc aat aga aac atg cgt384 Arg Asn Leu Lys Gln Arg Val Glu Glu Val Ser Asn Arg Asn Met Arg 115120 125 tac aat cta atc aag tct gtt cca tct agc agt acc gat gac tcc caa432 Tyr Asn Leu Ile Lys Ser Val Pro Ser Ser Ser Thr Asp Asp Ser Gln 130135 140 agc aac atg gaa ttg att cgg tat cag act gct cat tat gtt aat gaa480 Ser Asn Met Glu Leu Ile Arg Tyr Gln Thr Ala His Tyr Val Asn Glu 145150 155 160 tca gag ctt gtt ggg ttt gac gtt ccc aaa aag gag atc ctg gagttg 528 Ser Glu Leu Val Gly Phe Asp Val Pro Lys Lys Glu Ile Leu Glu Leu165 170 175 gta tca agc agt gaa aat gct gaa gcc caa aca att tgg att gtcgga 576 Val Ser Ser Ser Glu Asn Ala Glu Ala Gln Thr Ile Trp Ile Val Gly180 185 190 gaa ggt ggt ctt ggg aag act act ctt gca aag aag gtt tat gaaagc 624 Glu Gly Gly Leu Gly Lys Thr Thr Leu Ala Lys Lys Val Tyr Glu Ser195 200 205 tca gat gtt agt agc aag ttc ccg tgt cgt gct tgg atc act gtgtca 672 Ser Asp Val Ser Ser Lys Phe Pro Cys Arg Ala Trp Ile Thr Val Ser210 215 220 cga tca ttt aat gtc aag gat tta ctg aag aac atg atc aag caatta 720 Arg Ser Phe Asn Val Lys Asp Leu Leu Lys Asn Met Ile Lys Gln Leu225 230 235 240 cta ggg gaa gac tca ctc aag aaa ctc ttg gat gaa cac aaagag gtg 768 Leu Gly Glu Asp Ser Leu Lys Lys Leu Leu Asp Glu His Lys GluVal 245 250 255 att gta gaa aag cac aac ctt aca gac cac ctg aac aaa gggctt aga 816 Ile Val Glu Lys His Asn Leu Thr Asp His Leu Asn Lys Gly LeuArg 260 265 270 ggg aga agg tat ttt ctt gtt ctt gat gac ttg tgg acc acacaa gca 864 Gly Arg Arg Tyr Phe Leu Val Leu Asp Asp Leu Trp Thr Thr GlnAla 275 280 285 tgg gat tgc att aaa cct act tcc tgg gga aat aac atg gaaggt agc 912 Trp Asp Cys Ile Lys Pro Thr Ser Trp Gly Asn Asn Met Glu GlySer 290 295 300 cga gtt gta gta aca aca aga aac aaa aac cta gca ggt ggtagt tcc 960 Arg Val Val Val Thr Thr Arg Asn Lys Asn Leu Ala Gly Gly SerSer 305 310 315 320 acc tcg cca gtc tac tgc ctt aaa act cta gag aaa gaagat gct acc 1008 Thr Ser Pro Val Tyr Cys Leu Lys Thr Leu Glu Lys Glu AspAla Thr 325 330 335 aaa ttg ttg tta aga aag aca aaa aga agt ctg cat gacata gaa aaa 1056 Lys Leu Leu Leu Arg Lys Thr Lys Arg Ser Leu His Asp IleGlu Lys 340 345 350 gat caa atg aaa gaa ata ttt gaa aag ata cta aag aaatgt gga ggt 1104 Asp Gln Met Lys Glu Ile Phe Glu Lys Ile Leu Lys Lys CysGly Gly 355 360 365 cta ccg cta gct ata atc aca ata ggt gca gtc ctt gaaggg aaa gat 1152 Leu Pro Leu Ala Ile Ile Thr Ile Gly Ala Val Leu Glu GlyLys Asp 370 375 380 ata aaa gag tgg gaa att ttg tat gct caa ctt cca tcagaa ctt gaa 1200 Ile Lys Glu Trp Glu Ile Leu Tyr Ala Gln Leu Pro Ser GluLeu Glu 385 390 395 400 agc aat cca atc gct gaa cca atg aag aag gtg gtaacc ctt agt tac 1248 Ser Asn Pro Ile Ala Glu Pro Met Lys Lys Val Val ThrLeu Ser Tyr 405 410 415 aat tac ttg cca tct cat ctt aag cct tgc ttt ctgtac ctt tgc atc 1296 Asn Tyr Leu Pro Ser His Leu Lys Pro Cys Phe Leu TyrLeu Cys Ile 420 425 430 ttt cct gag gat ttt gat atc caa agg aag cgc ctagta cat aga tgg 1344 Phe Pro Glu Asp Phe Asp Ile Gln Arg Lys Arg Leu ValHis Arg Trp 435 440 445 att gca gag ggg ttt att aga gct agg ggt gga gtggga att gtg gat 1392 Ile Ala Glu Gly Phe Ile Arg Ala Arg Gly Gly Val GlyIle Val Asp 450 455 460 gtg gca caa aag tat ttt gat gag ttg atc aac cgaagt atg att caa 1440 Val Ala Gln Lys Tyr Phe Asp Glu Leu Ile Asn Arg SerMet Ile Gln 465 470 475 480 gca tct aga gtg gat ata gag gga aat att aagagc tgc cga gtc cat 1488 Ala Ser Arg Val Asp Ile Glu Gly Asn Ile Lys SerCys Arg Val His 485 490 495 gat atc atg cgt gat gtc atg ata tca ata tctagg gaa gaa aat ttt 1536 Asp Ile Met Arg Asp Val Met Ile Ser Ile Ser ArgGlu Glu Asn Phe 500 505 510 gta tac ttg atg ggt gat gat gga act agt gtagtg gag gaa aat att 1584 Val Tyr Leu Met Gly Asp Asp Gly Thr Ser Val ValGlu Glu Asn Ile 515 520 525 cgc cat tta gtg cac cat gat act agc aag tgttca aat ata ggc atg 1632 Arg His Leu Val His His Asp Thr Ser Lys Cys SerAsn Ile Gly Met 530 535 540 gat tgg agc cat gta cgg tca tta act ttg tttggc aat gag aga ccc 1680 Asp Trp Ser His Val Arg Ser Leu Thr Leu Phe GlyAsn Glu Arg Pro 545 550 555 560 aaa ggg cta tct cct tca ttt tgt ttt ccccaa ttg aag atg cta aga 1728 Lys Gly Leu Ser Pro Ser Phe Cys Phe Pro GlnLeu Lys Met Leu Arg 565 570 575 gtg ctg gat ctt caa gat gtc aaa ttt gggatg aca caa aaa gat atc 1776 Val Leu Asp Leu Gln Asp Val Lys Phe Gly MetThr Gln Lys Asp Ile 580 585 590 gga aaa ata ggg ttg ctg cgt cac ttg aaatat gtg aat att gga ggg 1824 Gly Lys Ile Gly Leu Leu Arg His Leu Lys TyrVal Asn Ile Gly Gly 595 600 605 cat tca agt att tat gca ctt cct agg tgtata gga aaa tta aaa gac 1872 His Ser Ser Ile Tyr Ala Leu Pro Arg Cys IleGly Lys Leu Lys Asp 610 615 620 ttg tgc act ttg gac ata act gac agt tacatt aca gaa cta cca act 1920 Leu Cys Thr Leu Asp Ile Thr Asp Ser Tyr IleThr Glu Leu Pro Thr 625 630 635 640 gag att agt aaa ttg cag agt cta tgtatt ctc cgt tgt aga gga aga 1968 Glu Ile Ser Lys Leu Gln Ser Leu Cys IleLeu Arg Cys Arg Gly Arg 645 650 655 cca aac tcg ggg gat ttt aat cta aatgat cct aag gat tgt cta att 2016 Pro Asn Ser Gly Asp Phe Asn Leu Asn AspPro Lys Asp Cys Leu Ile 660 665 670 gcc ttc tca tgt ttg cct ctg ctt atggct gca acc gat tct gat gaa 2064 Ala Phe Ser Cys Leu Pro Leu Leu Met AlaAla Thr Asp Ser Asp Glu 675 680 685 cgt aat aaa ata att gct gag cta cacgtg ggt tgt tca agt caa tgg 2112 Arg Asn Lys Ile Ile Ala Glu Leu His ValGly Cys Ser Ser Gln Trp 690 695 700 tct cct aat ggt ggt aca tat ggt gtgagg gta cct aga gga atc aag 2160 Ser Pro Asn Gly Gly Thr Tyr Gly Val ArgVal Pro Arg Gly Ile Lys 705 710 715 720 aat ttg aaa agg ctg cag gtg ctagag aca gtg gat atc aat cga acc 2208 Asn Leu Lys Arg Leu Gln Val Leu GluThr Val Asp Ile Asn Arg Thr 725 730 735 agc agt aag tca gtt gaa gag ttggga gag ctt atc cag cta aga aaa 2256 Ser Ser Lys Ser Val Glu Glu Leu GlyGlu Leu Ile Gln Leu Arg Lys 740 745 750 cta agt gta gta aca caa ggg tccacc aag gaa aaa tgc aaa ata ctc 2304 Leu Ser Val Val Thr Gln Gly Ser ThrLys Glu Lys Cys Lys Ile Leu 755 760 765 tgt aca gcc att cag aag cta acttcc ctc aaa act ctc tat ctg aat 2352 Cys Thr Ala Ile Gln Lys Leu Thr SerLeu Lys Thr Leu Tyr Leu Asn 770 775 780 gct cat gga ccc ttg gat act ggaaca ctt gaa tgg cta cat tct att 2400 Ala His Gly Pro Leu Asp Thr Gly ThrLeu Glu Trp Leu His Ser Ile 785 790 795 800 tcc cat ctt cct tcc ctg aggatc atc aga ttg atc gga tac atg aaa 2448 Ser His Leu Pro Ser Leu Arg IleIle Arg Leu Ile Gly Tyr Met Lys 805 810 815 gag atg ccc aac tgg ttt agggag ctc aga cag ctg gtg aag att cac 2496 Glu Met Pro Asn Trp Phe Arg GluLeu Arg Gln Leu Val Lys Ile His 820 825 830 ttg cag aat agc caa cta gaggag gat aaa aca atg gag ata ctc gga 2544 Leu Gln Asn Ser Gln Leu Glu GluAsp Lys Thr Met Glu Ile Leu Gly 835 840 845 gaa ctg cct aat ctc atg ctcctt ttc ctt agt tgg cga gcg gtg cta 2592 Glu Leu Pro Asn Leu Met Leu LeuPhe Leu Ser Trp Arg Ala Val Leu 850 855 860 atc att cgc aat cag aag caactg aaa gag gtg aga ttt gaa gag gga 2640 Ile Ile Arg Asn Gln Lys Gln LeuLys Glu Val Arg Phe Glu Glu Gly 865 870 875 880 acc tca ccc tgg atg gaaagg att gat atc aga gaa tgc aga ttg aca 2688 Thr Ser Pro Trp Met Glu ArgIle Asp Ile Arg Glu Cys Arg Leu Thr 885 890 895 tca ggg att gct ggt atcaaa cac ctt ccg agg ctt aag gag att tca 2736 Ser Gly Ile Ala Gly Ile LysHis Leu Pro Arg Leu Lys Glu Ile Ser 900 905 910 ctt gaa tac agt gct aaagtg gtg agg cta ggt cag cta gag gtt gaa 2784 Leu Glu Tyr Ser Ala Lys ValVal Arg Leu Gly Gln Leu Glu Val Glu 915 920 925 atg ggc aca cac ccc aatcga ccc atg ttg cgc ctg ttt ggg gag cga 2832 Met Gly Thr His Pro Asn ArgPro Met Leu Arg Leu Phe Gly Glu Arg 930 935 940 agc cgt cat gac ctg gggaac acc caa gta act gtc gat gaa caa caa 2880 Ser Arg His Asp Leu Gly AsnThr Gln Val Thr Val Asp Glu Gln Gln 945 950 955 960 ctg cag gtc agt ccattt att tat ggt gtg cat gca act act tgt ttt 2928 Leu Gln Val Ser Pro PheIle Tyr Gly Val His Ala Thr Thr Cys Phe 965 970 975 tct ttc tct tct ttctct tgc tta cta ttt tcc tcc ttc ctt gtt tgt 2976 Ser Phe Ser Ser Phe SerCys Leu Leu Phe Ser Ser Phe Leu Val Cys 980 985 990 ctc tag 2982 Leu 87993 PRT Oryza minuta 87 Met Thr Glu Thr Val Val Ser Met Ala Met Ser LeuVal Gly Ser Ala 1 5 10 15 Ile Arg Val Ala Ser Ser Ala Ser Ser Gln GluMet Ser Met Leu Ile 20 25 30 Gly Val Gln Asn Glu Ile Trp Phe Ile Lys AspGlu Leu Lys Thr Met 35 40 45 Gln Ala Phe Leu Arg Ala Ala Glu Val Arg LysAsp Gln Asp Glu Leu 50 55 60 Val Lys Val Trp Ala Glu Gln Val Arg Asp LeuAla Tyr Asp Ile Glu 65 70 75 80 Asp Cys Leu Gln Glu Phe Ala Val His ValGly His Gln Ser Leu Ser 85 90 95 Met Gln Leu Ile Lys Leu Cys His Arg HisArg Ile Ala Val Gln Ile 100 105 110 Arg Asn Leu Lys Gln Arg Val Glu GluVal Ser Asn Arg Asn Met Arg 115 120 125 Tyr Asn Leu Ile Lys Ser Val ProSer Ser Ser Thr Asp Asp Ser Gln 130 135 140 Ser Asn Met Glu Leu Ile ArgTyr Gln Thr Ala His Tyr Val Asn Glu 145 150 155 160 Ser Glu Leu Val GlyPhe Asp Val Pro Lys Lys Glu Ile Leu Glu Leu 165 170 175 Val Ser Ser SerGlu Asn Ala Glu Ala Gln Thr Ile Trp Ile Val Gly 180 185 190 Glu Gly GlyLeu Gly Lys Thr Thr Leu Ala Lys Lys Val Tyr Glu Ser 195 200 205 Ser AspVal Ser Ser Lys Phe Pro Cys Arg Ala Trp Ile Thr Val Ser 210 215 220 ArgSer Phe Asn Val Lys Asp Leu Leu Lys Asn Met Ile Lys Gln Leu 225 230 235240 Leu Gly Glu Asp Ser Leu Lys Lys Leu Leu Asp Glu His Lys Glu Val 245250 255 Ile Val Glu Lys His Asn Leu Thr Asp His Leu Asn Lys Gly Leu Arg260 265 270 Gly Arg Arg Tyr Phe Leu Val Leu Asp Asp Leu Trp Thr Thr GlnAla 275 280 285 Trp Asp Cys Ile Lys Pro Thr Ser Trp Gly Asn Asn Met GluGly Ser 290 295 300 Arg Val Val Val Thr Thr Arg Asn Lys Asn Leu Ala GlyGly Ser Ser 305 310 315 320 Thr Ser Pro Val Tyr Cys Leu Lys Thr Leu GluLys Glu Asp Ala Thr 325 330 335 Lys Leu Leu Leu Arg Lys Thr Lys Arg SerLeu His Asp Ile Glu Lys 340 345 350 Asp Gln Met Lys Glu Ile Phe Glu LysIle Leu Lys Lys Cys Gly Gly 355 360 365 Leu Pro Leu Ala Ile Ile Thr IleGly Ala Val Leu Glu Gly Lys Asp 370 375 380 Ile Lys Glu Trp Glu Ile LeuTyr Ala Gln Leu Pro Ser Glu Leu Glu 385 390 395 400 Ser Asn Pro Ile AlaGlu Pro Met Lys Lys Val Val Thr Leu Ser Tyr 405 410 415 Asn Tyr Leu ProSer His Leu Lys Pro Cys Phe Leu Tyr Leu Cys Ile 420 425 430 Phe Pro GluAsp Phe Asp Ile Gln Arg Lys Arg Leu Val His Arg Trp 435 440 445 Ile AlaGlu Gly Phe Ile Arg Ala Arg Gly Gly Val Gly Ile Val Asp 450 455 460 ValAla Gln Lys Tyr Phe Asp Glu Leu Ile Asn Arg Ser Met Ile Gln 465 470 475480 Ala Ser Arg Val Asp Ile Glu Gly Asn Ile Lys Ser Cys Arg Val His 485490 495 Asp Ile Met Arg Asp Val Met Ile Ser Ile Ser Arg Glu Glu Asn Phe500 505 510 Val Tyr Leu Met Gly Asp Asp Gly Thr Ser Val Val Glu Glu AsnIle 515 520 525 Arg His Leu Val His His Asp Thr Ser Lys Cys Ser Asn IleGly Met 530 535 540 Asp Trp Ser His Val Arg Ser Leu Thr Leu Phe Gly AsnGlu Arg Pro 545 550 555 560 Lys Gly Leu Ser Pro Ser Phe Cys Phe Pro GlnLeu Lys Met Leu Arg 565 570 575 Val Leu Asp Leu Gln Asp Val Lys Phe GlyMet Thr Gln Lys Asp Ile 580 585 590 Gly Lys Ile Gly Leu Leu Arg His LeuLys Tyr Val Asn Ile Gly Gly 595 600 605 His Ser Ser Ile Tyr Ala Leu ProArg Cys Ile Gly Lys Leu Lys Asp 610 615 620 Leu Cys Thr Leu Asp Ile ThrAsp Ser Tyr Ile Thr Glu Leu Pro Thr 625 630 635 640 Glu Ile Ser Lys LeuGln Ser Leu Cys Ile Leu Arg Cys Arg Gly Arg 645 650 655 Pro Asn Ser GlyAsp Phe Asn Leu Asn Asp Pro Lys Asp Cys Leu Ile 660 665 670 Ala Phe SerCys Leu Pro Leu Leu Met Ala Ala Thr Asp Ser Asp Glu 675 680 685 Arg AsnLys Ile Ile Ala Glu Leu His Val Gly Cys Ser Ser Gln Trp 690 695 700 SerPro Asn Gly Gly Thr Tyr Gly Val Arg Val Pro Arg Gly Ile Lys 705 710 715720 Asn Leu Lys Arg Leu Gln Val Leu Glu Thr Val Asp Ile Asn Arg Thr 725730 735 Ser Ser Lys Ser Val Glu Glu Leu Gly Glu Leu Ile Gln Leu Arg Lys740 745 750 Leu Ser Val Val Thr Gln Gly Ser Thr Lys Glu Lys Cys Lys IleLeu 755 760 765 Cys Thr Ala Ile Gln Lys Leu Thr Ser Leu Lys Thr Leu TyrLeu Asn 770 775 780 Ala His Gly Pro Leu Asp Thr Gly Thr Leu Glu Trp LeuHis Ser Ile 785 790 795 800 Ser His Leu Pro Ser Leu Arg Ile Ile Arg LeuIle Gly Tyr Met Lys 805 810 815 Glu Met Pro Asn Trp Phe Arg Glu Leu ArgGln Leu Val Lys Ile His 820 825 830 Leu Gln Asn Ser Gln Leu Glu Glu AspLys Thr Met Glu Ile Leu Gly 835 840 845 Glu Leu Pro Asn Leu Met Leu LeuPhe Leu Ser Trp Arg Ala Val Leu 850 855 860 Ile Ile Arg Asn Gln Lys GlnLeu Lys Glu Val Arg Phe Glu Glu Gly 865 870 875 880 Thr Ser Pro Trp MetGlu Arg Ile Asp Ile Arg Glu Cys Arg Leu Thr 885 890 895 Ser Gly Ile AlaGly Ile Lys His Leu Pro Arg Leu Lys Glu Ile Ser 900 905 910 Leu Glu TyrSer Ala Lys Val Val Arg Leu Gly Gln Leu Glu Val Glu 915 920 925 Met GlyThr His Pro Asn Arg Pro Met Leu Arg Leu Phe Gly Glu Arg 930 935 940 SerArg His Asp Leu Gly Asn Thr Gln Val Thr Val Asp Glu Gln Gln 945 950 955960 Leu Gln Val Ser Pro Phe Ile Tyr Gly Val His Ala Thr Thr Cys Phe 965970 975 Ser Phe Ser Ser Phe Ser Cys Leu Leu Phe Ser Ser Phe Leu Val Cys980 985 990 Leu 88 3096 DNA Oryza minuta CDS (1)..(3096) 88 atg gcg gagacg gtg ctg agc atg gcg agg tcg ctg gtg ggc agt gcc 48 Met Ala Glu ThrVal Leu Ser Met Ala Arg Ser Leu Val Gly Ser Ala 1 5 10 15 atc agc aaggcc gcc tct gcc gct gcc aat gag acg agc ctc ctg ctc 96 Ile Ser Lys AlaAla Ser Ala Ala Ala Asn Glu Thr Ser Leu Leu Leu 20 25 30 ggc gtc gag aaggac atc tgg tat atc aaa gat gag cta aaa aca atg 144 Gly Val Glu Lys AspIle Trp Tyr Ile Lys Asp Glu Leu Lys Thr Met 35 40 45 cag gca ttc ctt agagct gct gaa gtt atg aaa aag aaa gat gaa cta 192 Gln Ala Phe Leu Arg AlaAla Glu Val Met Lys Lys Lys Asp Glu Leu 50 55 60 tta aag gtt tgg gca gagcaa ata cgt gac ctg tcg tat gac att gaa 240 Leu Lys Val Trp Ala Glu GlnIle Arg Asp Leu Ser Tyr Asp Ile Glu 65 70 75 80 gat tcc ctt gat gaa tttaaa gtc cat att gaa agc caa acc cta ttt 288 Asp Ser Leu Asp Glu Phe LysVal His Ile Glu Ser Gln Thr Leu Phe 85 90 95 cgt cag ttg gtg aaa ctt agagag cgc cac cgg atc gct atc cgt atc 336 Arg Gln Leu Val Lys Leu Arg GluArg His Arg Ile Ala Ile Arg Ile 100 105 110 cac aac ctc aaa tca aga gttgaa gaa gtg agt agc agg aac aca cgc 384 His Asn Leu Lys Ser Arg Val GluGlu Val Ser Ser Arg Asn Thr Arg 115 120 125 tac aat tta gtc gag cct atttcc tcc ggc aca gag gat gac atg gat 432 Tyr Asn Leu Val Glu Pro Ile SerSer Gly Thr Glu Asp Asp Met Asp 130 135 140 tcc tat gca gaa gac att cgcaat caa tca gct cga aat gtg gat gaa 480 Ser Tyr Ala Glu Asp Ile Arg AsnGln Ser Ala Arg Asn Val Asp Glu 145 150 155 160 gct gag ctt gtt ggg ttttct gac tcc aag aaa agg ctg ctt gaa atg 528 Ala Glu Leu Val Gly Phe SerAsp Ser Lys Lys Arg Leu Leu Glu Met 165 170 175 atc gat acc aat gct aatgat ggt ccg gcc aag gta atc tgt gtt gtt 576 Ile Asp Thr Asn Ala Asn AspGly Pro Ala Lys Val Ile Cys Val Val 180 185 190 ggg atg ggt ggt tta ggcaag aca gct ctt tcg agg aag atc ttt gaa 624 Gly Met Gly Gly Leu Gly LysThr Ala Leu Ser Arg Lys Ile Phe Glu 195 200 205 agc gaa gaa gac att aggaag aac ttc cct tgc att gct tgg att aca 672 Ser Glu Glu Asp Ile Arg LysAsn Phe Pro Cys Ile Ala Trp Ile Thr 210 215 220 gtg tca caa tca ttt cacagg att gag cta ctt aaa gat atg ata cgc 720 Val Ser Gln Ser Phe His ArgIle Glu Leu Leu Lys Asp Met Ile Arg 225 230 235 240 caa ctt ctt ggc cccagt tct ctg gat caa ctc ttg caa gaa ttg caa 768 Gln Leu Leu Gly Pro SerSer Leu Asp Gln Leu Leu Gln Glu Leu Gln 245 250 255 ggg aag gtg gtg gtgcaa gta cat cat ctt tct gag tac ctg ata gaa 816 Gly Lys Val Val Val GlnVal His His Leu Ser Glu Tyr Leu Ile Glu 260 265 270 gag ctc aag gag aagagg tac ttt gtt att cta gat gat cta tgg att 864 Glu Leu Lys Glu Lys ArgTyr Phe Val Ile Leu Asp Asp Leu Trp Ile 275 280 285 tta cat gat tgg aattgg ata aat gaa att gca ttt cct aag aac aat 912 Leu His Asp Trp Asn TrpIle Asn Glu Ile Ala Phe Pro Lys Asn Asn 290 295 300 aag aag ggc agt cgaata gta ata acc act cgg aat gtt gat cta gcg 960 Lys Lys Gly Ser Arg IleVal Ile Thr Thr Arg Asn Val Asp Leu Ala 305 310 315 320 gag aag tgt gccaca gcc tca ctg gtg tac cac ctt gat ttc ttg cag 1008 Glu Lys Cys Ala ThrAla Ser Leu Val Tyr His Leu Asp Phe Leu Gln 325 330 335 atg aac gat gccata aca ttg cta ctg aga aaa aca aat aaa aat cat 1056 Met Asn Asp Ala IleThr Leu Leu Leu Arg Lys Thr Asn Lys Asn His 340 345 350 gaa gac atg gaatca aat aaa aat atg caa aag atg gtt gaa cga att 1104 Glu Asp Met Glu SerAsn Lys Asn Met Gln Lys Met Val Glu Arg Ile 355 360 365 gta aat aaa tgtggt cgt cta cca tta gca ata ctt aca ata gga gct 1152 Val Asn Lys Cys GlyArg Leu Pro Leu Ala Ile Leu Thr Ile Gly Ala 370 375 380 gtg ctt gca actaaa cat gtg tca gaa tgg gag aaa ttc tat gaa caa 1200 Val Leu Ala Thr LysHis Val Ser Glu Trp Glu Lys Phe Tyr Glu Gln 385 390 395 400 ctt cct tcagaa cta gaa ata aac cca agc ctg gaa gct ttg agg aga 1248 Leu Pro Ser GluLeu Glu Ile Asn Pro Ser Leu Glu Ala Leu Arg Arg 405 410 415 atg gtg acccta ggt tac aac cac cta cca tcc cat ctg aaa cca tgc 1296 Met Val Thr LeuGly Tyr Asn His Leu Pro Ser His Leu Lys Pro Cys 420 425 430 ttt ttg tatcta agt atc ttt cct gag gat ttt gaa atc aaa agg aat 1344 Phe Leu Tyr LeuSer Ile Phe Pro Glu Asp Phe Glu Ile Lys Arg Asn 435 440 445 cgt cta gtaggt aga tgg ata gca gaa ggg ttt gtt aga cca aag gtt 1392 Arg Leu Val GlyArg Trp Ile Ala Glu Gly Phe Val Arg Pro Lys Val 450 455 460 ggg atg acgact aag gat gtc gga gaa agt tac ttt aat gag cta atc 1440 Gly Met Thr ThrLys Asp Val Gly Glu Ser Tyr Phe Asn Glu Leu Ile 465 470 475 480 aac cgaagt atg att caa cga tca aga gtg ggc ata gca gga aaa att 1488 Asn Arg SerMet Ile Gln Arg Ser Arg Val Gly Ile Ala Gly Lys Ile 485 490 495 aag acttgt cga atc cat gat atc atc cgt gat atc aca gtt tca atc 1536 Lys Thr CysArg Ile His Asp Ile Ile Arg Asp Ile Thr Val Ser Ile 500 505 510 tcg agacag gaa aat ttt gta tta tta cca atg gga gat ggc tct gat 1584 Ser Arg GlnGlu Asn Phe Val Leu Leu Pro Met Gly Asp Gly Ser Asp 515 520 525 tta gttcag gaa aac act cgc cac ata gca ttc cat ggg agt atg tcc 1632 Leu Val GlnGlu Asn Thr Arg His Ile Ala Phe His Gly Ser Met Ser 530 535 540 tgc aaaaca gga ttg gat tgg agc att att cga tca tta gct att ttt 1680 Cys Lys ThrGly Leu Asp Trp Ser Ile Ile Arg Ser Leu Ala Ile Phe 545 550 555 560 ggtgac aga ccc aag agt cta gca cat gca gtt tgt cta gat caa ttg 1728 Gly AspArg Pro Lys Ser Leu Ala His Ala Val Cys Leu Asp Gln Leu 565 570 575 aggatg tta cgg gtc ttg gat ctt gaa gat gtg aca ttc tta atc act 1776 Arg MetLeu Arg Val Leu Asp Leu Glu Asp Val Thr Phe Leu Ile Thr 580 585 590 caaaaa gat ttc gac cgt att gca ttg ttg tgc cac ttg aaa tac ttg 1824 Gln LysAsp Phe Asp Arg Ile Ala Leu Leu Cys His Leu Lys Tyr Leu 595 600 605 agtatt gga tat tcg tca tcc ata tat tca ctt ccc aga tcc att ggt 1872 Ser IleGly Tyr Ser Ser Ser Ile Tyr Ser Leu Pro Arg Ser Ile Gly 610 615 620 aaacta cag ggc cta caa act ttg aac atg ctg aga aca tac att gca 1920 Lys LeuGln Gly Leu Gln Thr Leu Asn Met Leu Arg Thr Tyr Ile Ala 625 630 635 640gca cta cca agt gag atc agt aaa ctc caa tgt ctg cat act ctt cgt 1968 AlaLeu Pro Ser Glu Ile Ser Lys Leu Gln Cys Leu His Thr Leu Arg 645 650 655tgt agt aga aag ttt gtt tat gac aac ttt agt cta aac cac cca atg 2016 CysSer Arg Lys Phe Val Tyr Asp Asn Phe Ser Leu Asn His Pro Met 660 665 670aag tgc ata act aac aca ata tgc ctg cct aaa gta ttc aca cct tta 2064 LysCys Ile Thr Asn Thr Ile Cys Leu Pro Lys Val Phe Thr Pro Leu 675 680 685gtt agt cgc gat gat cgt gca aaa caa att gct gaa ttg cac atg gcc 2112 ValSer Arg Asp Asp Arg Ala Lys Gln Ile Ala Glu Leu His Met Ala 690 695 700acc aaa agt tgc tgg tct gaa tca ttc ggt gtg aag gta ccc aaa gga 2160 ThrLys Ser Cys Trp Ser Glu Ser Phe Gly Val Lys Val Pro Lys Gly 705 710 715720 ata ggt aag ttg cga gac ttg cag gtt cta gag tat gta gat atc agg 2208Ile Gly Lys Leu Arg Asp Leu Gln Val Leu Glu Tyr Val Asp Ile Arg 725 730735 cgg acc agt agt aga gca atc aaa gag ctg ggg cac tta agc aag ttg 2256Arg Thr Ser Ser Arg Ala Ile Lys Glu Leu Gly His Leu Ser Lys Leu 740 745750 agg aaa tta ggt gtg ata aca aaa ggc tcg aca aag gaa aaa tgt aag 2304Arg Lys Leu Gly Val Ile Thr Lys Gly Ser Thr Lys Glu Lys Cys Lys 755 760765 ata ctt tat gca gcc att gag aag ctc tct tcc ctc caa tct ctc tat 2352Ile Leu Tyr Ala Ala Ile Glu Lys Leu Ser Ser Leu Gln Ser Leu Tyr 770 775780 gtg aat gct gcg tta tta tca gat att gaa aca ctt gag tgc cta gat 2400Val Asn Ala Ala Leu Leu Ser Asp Ile Glu Thr Leu Glu Cys Leu Asp 785 790795 800 tct att tca tct cct cct ccc cta ctg agg aca ctc ggg ttg aat gga2448 Ser Ile Ser Ser Pro Pro Pro Leu Leu Arg Thr Leu Gly Leu Asn Gly 805810 815 agt ctt gaa gag atg cct aac tgg att gag cag ctc act cac ctg aag2496 Ser Leu Glu Glu Met Pro Asn Trp Ile Glu Gln Leu Thr His Leu Lys 820825 830 aag atc tac tta ttg agg agc aaa cta aag gaa ggt aaa acc atg ctg2544 Lys Ile Tyr Leu Leu Arg Ser Lys Leu Lys Glu Gly Lys Thr Met Leu 835840 845 ata ctt ggg gca ttg ccc aac ctc atg gtc ctt tat ctt tat tgg aat2592 Ile Leu Gly Ala Leu Pro Asn Leu Met Val Leu Tyr Leu Tyr Trp Asn 850855 860 gct tac ctt ggg gag aag cta gta ttc aaa acg gga gca ttc cca aat2640 Ala Tyr Leu Gly Glu Lys Leu Val Phe Lys Thr Gly Ala Phe Pro Asn 865870 875 880 ctt aga aca ctt cgt att tac gaa ttg gat cag cta aga gag atgaga 2688 Leu Arg Thr Leu Arg Ile Tyr Glu Leu Asp Gln Leu Arg Glu Met Arg885 890 895 ttt gag gat ggc agc tca ccc ctg ttg gaa aag ata gaa atc tcttgc 2736 Phe Glu Asp Gly Ser Ser Pro Leu Leu Glu Lys Ile Glu Ile Ser Cys900 905 910 tgc agg ttg gaa tca ggg att att ggt atc att cac ctt cca aggctc 2784 Cys Arg Leu Glu Ser Gly Ile Ile Gly Ile Ile His Leu Pro Arg Leu915 920 925 aag gag att tca ctt gaa tac aaa agt aaa gtg gct agg ctt ggtcag 2832 Lys Glu Ile Ser Leu Glu Tyr Lys Ser Lys Val Ala Arg Leu Gly Gln930 935 940 ctg gag gga gaa gtg aac aca cac cca aat cgc ccc gtg ctg cgaatg 2880 Leu Glu Gly Glu Val Asn Thr His Pro Asn Arg Pro Val Leu Arg Met945 950 955 960 gac agt gac cga agg gat cac gac ctg ggg gct gaa gcc gaagga tct 2928 Asp Ser Asp Arg Arg Asp His Asp Leu Gly Ala Glu Ala Glu GlySer 965 970 975 tct ata gaa gtg caa aca gca gat cct gtt cct gat gcc gaagga tca 2976 Ser Ile Glu Val Gln Thr Ala Asp Pro Val Pro Asp Ala Glu GlySer 980 985 990 gtc act gta gca gtg gaa gca acg gat ccc ctt ccc gag caggag gga 3024 Val Thr Val Ala Val Glu Ala Thr Asp Pro Leu Pro Glu Gln GluGly 995 1000 1005 gag agc tcg cag tcg cag gtg atc acg ttg acg acg aacgat agg 3069 Glu Ser Ser Gln Ser Gln Val Ile Thr Leu Thr Thr Asn Asp Arg1010 1015 1020 tca gtc act ccc tac atg gca gct taa 3096 Ser Val Thr ProTyr Met Ala Ala 1025 1030 89 1031 PRT Oryza minuta 89 Met Ala Glu ThrVal Leu Ser Met Ala Arg Ser Leu Val Gly Ser Ala 1 5 10 15 Ile Ser LysAla Ala Ser Ala Ala Ala Asn Glu Thr Ser Leu Leu Leu 20 25 30 Gly Val GluLys Asp Ile Trp Tyr Ile Lys Asp Glu Leu Lys Thr Met 35 40 45 Gln Ala PheLeu Arg Ala Ala Glu Val Met Lys Lys Lys Asp Glu Leu 50 55 60 Leu Lys ValTrp Ala Glu Gln Ile Arg Asp Leu Ser Tyr Asp Ile Glu 65 70 75 80 Asp SerLeu Asp Glu Phe Lys Val His Ile Glu Ser Gln Thr Leu Phe 85 90 95 Arg GlnLeu Val Lys Leu Arg Glu Arg His Arg Ile Ala Ile Arg Ile 100 105 110 HisAsn Leu Lys Ser Arg Val Glu Glu Val Ser Ser Arg Asn Thr Arg 115 120 125Tyr Asn Leu Val Glu Pro Ile Ser Ser Gly Thr Glu Asp Asp Met Asp 130 135140 Ser Tyr Ala Glu Asp Ile Arg Asn Gln Ser Ala Arg Asn Val Asp Glu 145150 155 160 Ala Glu Leu Val Gly Phe Ser Asp Ser Lys Lys Arg Leu Leu GluMet 165 170 175 Ile Asp Thr Asn Ala Asn Asp Gly Pro Ala Lys Val Ile CysVal Val 180 185 190 Gly Met Gly Gly Leu Gly Lys Thr Ala Leu Ser Arg LysIle Phe Glu 195 200 205 Ser Glu Glu Asp Ile Arg Lys Asn Phe Pro Cys IleAla Trp Ile Thr 210 215 220 Val Ser Gln Ser Phe His Arg Ile Glu Leu LeuLys Asp Met Ile Arg 225 230 235 240 Gln Leu Leu Gly Pro Ser Ser Leu AspGln Leu Leu Gln Glu Leu Gln 245 250 255 Gly Lys Val Val Val Gln Val HisHis Leu Ser Glu Tyr Leu Ile Glu 260 265 270 Glu Leu Lys Glu Lys Arg TyrPhe Val Ile Leu Asp Asp Leu Trp Ile 275 280 285 Leu His Asp Trp Asn TrpIle Asn Glu Ile Ala Phe Pro Lys Asn Asn 290 295 300 Lys Lys Gly Ser ArgIle Val Ile Thr Thr Arg Asn Val Asp Leu Ala 305 310 315 320 Glu Lys CysAla Thr Ala Ser Leu Val Tyr His Leu Asp Phe Leu Gln 325 330 335 Met AsnAsp Ala Ile Thr Leu Leu Leu Arg Lys Thr Asn Lys Asn His 340 345 350 GluAsp Met Glu Ser Asn Lys Asn Met Gln Lys Met Val Glu Arg Ile 355 360 365Val Asn Lys Cys Gly Arg Leu Pro Leu Ala Ile Leu Thr Ile Gly Ala 370 375380 Val Leu Ala Thr Lys His Val Ser Glu Trp Glu Lys Phe Tyr Glu Gln 385390 395 400 Leu Pro Ser Glu Leu Glu Ile Asn Pro Ser Leu Glu Ala Leu ArgArg 405 410 415 Met Val Thr Leu Gly Tyr Asn His Leu Pro Ser His Leu LysPro Cys 420 425 430 Phe Leu Tyr Leu Ser Ile Phe Pro Glu Asp Phe Glu IleLys Arg Asn 435 440 445 Arg Leu Val Gly Arg Trp Ile Ala Glu Gly Phe ValArg Pro Lys Val 450 455 460 Gly Met Thr Thr Lys Asp Val Gly Glu Ser TyrPhe Asn Glu Leu Ile 465 470 475 480 Asn Arg Ser Met Ile Gln Arg Ser ArgVal Gly Ile Ala Gly Lys Ile 485 490 495 Lys Thr Cys Arg Ile His Asp IleIle Arg Asp Ile Thr Val Ser Ile 500 505 510 Ser Arg Gln Glu Asn Phe ValLeu Leu Pro Met Gly Asp Gly Ser Asp 515 520 525 Leu Val Gln Glu Asn ThrArg His Ile Ala Phe His Gly Ser Met Ser 530 535 540 Cys Lys Thr Gly LeuAsp Trp Ser Ile Ile Arg Ser Leu Ala Ile Phe 545 550 555 560 Gly Asp ArgPro Lys Ser Leu Ala His Ala Val Cys Leu Asp Gln Leu 565 570 575 Arg MetLeu Arg Val Leu Asp Leu Glu Asp Val Thr Phe Leu Ile Thr 580 585 590 GlnLys Asp Phe Asp Arg Ile Ala Leu Leu Cys His Leu Lys Tyr Leu 595 600 605Ser Ile Gly Tyr Ser Ser Ser Ile Tyr Ser Leu Pro Arg Ser Ile Gly 610 615620 Lys Leu Gln Gly Leu Gln Thr Leu Asn Met Leu Arg Thr Tyr Ile Ala 625630 635 640 Ala Leu Pro Ser Glu Ile Ser Lys Leu Gln Cys Leu His Thr LeuArg 645 650 655 Cys Ser Arg Lys Phe Val Tyr Asp Asn Phe Ser Leu Asn HisPro Met 660 665 670 Lys Cys Ile Thr Asn Thr Ile Cys Leu Pro Lys Val PheThr Pro Leu 675 680 685 Val Ser Arg Asp Asp Arg Ala Lys Gln Ile Ala GluLeu His Met Ala 690 695 700 Thr Lys Ser Cys Trp Ser Glu Ser Phe Gly ValLys Val Pro Lys Gly 705 710 715 720 Ile Gly Lys Leu Arg Asp Leu Gln ValLeu Glu Tyr Val Asp Ile Arg 725 730 735 Arg Thr Ser Ser Arg Ala Ile LysGlu Leu Gly His Leu Ser Lys Leu 740 745 750 Arg Lys Leu Gly Val Ile ThrLys Gly Ser Thr Lys Glu Lys Cys Lys 755 760 765 Ile Leu Tyr Ala Ala IleGlu Lys Leu Ser Ser Leu Gln Ser Leu Tyr 770 775 780 Val Asn Ala Ala LeuLeu Ser Asp Ile Glu Thr Leu Glu Cys Leu Asp 785 790 795 800 Ser Ile SerSer Pro Pro Pro Leu Leu Arg Thr Leu Gly Leu Asn Gly 805 810 815 Ser LeuGlu Glu Met Pro Asn Trp Ile Glu Gln Leu Thr His Leu Lys 820 825 830 LysIle Tyr Leu Leu Arg Ser Lys Leu Lys Glu Gly Lys Thr Met Leu 835 840 845Ile Leu Gly Ala Leu Pro Asn Leu Met Val Leu Tyr Leu Tyr Trp Asn 850 855860 Ala Tyr Leu Gly Glu Lys Leu Val Phe Lys Thr Gly Ala Phe Pro Asn 865870 875 880 Leu Arg Thr Leu Arg Ile Tyr Glu Leu Asp Gln Leu Arg Glu MetArg 885 890 895 Phe Glu Asp Gly Ser Ser Pro Leu Leu Glu Lys Ile Glu IleSer Cys 900 905 910 Cys Arg Leu Glu Ser Gly Ile Ile Gly Ile Ile His LeuPro Arg Leu 915 920 925 Lys Glu Ile Ser Leu Glu Tyr Lys Ser Lys Val AlaArg Leu Gly Gln 930 935 940 Leu Glu Gly Glu Val Asn Thr His Pro Asn ArgPro Val Leu Arg Met 945 950 955 960 Asp Ser Asp Arg Arg Asp His Asp LeuGly Ala Glu Ala Glu Gly Ser 965 970 975 Ser Ile Glu Val Gln Thr Ala AspPro Val Pro Asp Ala Glu Gly Ser 980 985 990 Val Thr Val Ala Val Glu AlaThr Asp Pro Leu Pro Glu Gln Glu Gly 995 1000 1005 Glu Ser Ser Gln SerGln Val Ile Thr Leu Thr Thr Asn Asp Arg 1010 1015 1020 Ser Val Thr ProTyr Met Ala Ala 1025 1030 90 3220 DNA Oryza minuta 90 atggcggatacagtactcag cattgcaaag tccctggtgg gaagtgctgt aagcaaggtt 60 gcttcggttgccgcagacaa gatgatcatg ctgctgggag tgcagaagga gatatggttc 120 atcaaagatgagctacaaac gatacaagca tttttgattg ctgccgaagc atcaaagaaa 180 agcatactattgaaggtttg ggtgcagcaa gtaagggatc tttcctatga cattgaagat 240 tgccttgatgaatttacagt tcatgtgggc agccaaaact tgtcgaggca gttgatgaag 300 ctaaaggatcgccatcggat tgccatccag atccgcaatc tcaggacaag aattgaagaa 360 gtaagcactaggaacatacg ctacaactta atagagaatg acctcacctg caccactact 420 gatgagaggaatttatttat ggaagacatt cgcaaccaat aagctaacaa cattgaggaa 480 gctgatcttgtgggtttttc tggacccaaa agagagttgc ttgatcttat agatgtccat 540 gccaatgacggacctacaaa agttgtatgt gttgtcggta tgggtggttt gggtaagact 600 actattgcaaggaaaattta tgaaagcaaa gaggacattg caaagaattt ttcttgctgt 660 gcttggattactgtttcaca gtcctttgtt agggtggaac tactcaagga tttgatggtg 720 aaactttttggagaggaagt actgaagaag cggctgagag aactcgaagg gaaggttcca 780 caagtagacgacctcgccag ctacctcagg acagagttaa atgaaaggag gtactttgtt 840 gtgcttgataacgtgtggag tacagattca tggaaatgga ttaatagtat tgccttccct 900 agaaataacaataaagggag ccgggtgata gtaacaacaa gagatgttgg cttaactaag 960 gagtgtacttctgaattgct tatctaccag cttaaacccc tagaaataaa ctatgcaaaa 1020 gagttgcttctacggaaagc aaataaagca ataggagata tggaaagtga taaaaagatg 1080 agtgacattataactaaaat aataaagaag tgtggctatt taccgctggc tatactcaca 1140 ataggaggcgtgctttccac caaagagata agagagtggg aaacttttta tagtcagata 1200 ccttcagagcttgagagcaa cccaaacctt gaagcaatga gaaggatagt gaccctaagt 1260 tacaactacttaccgtctca tcttaagcaa tgctttttgt atctaagcat atttcctgag 1320 gattttgaaattaataggaa ccgtctggta aatagatgga tggcagaggg gtttattaaa 1380 gctaggactaatatgactat tgaagatgtt gggaaaagtt actttaaaga acttatcaac 1440 cgtagcatgattcagccatc aagagcgggt atacgaggag attttaagag ctgtcgagtc 1500 cacgacatcatgcgtgatat tacaatttcg atttctagag aagaaaattt cacactctta 1560 cccgatggcactgactatga tgtagtacat gggaacactc ggcacataga atttcacggg 1620 agtaagtattgctctgaaac aagcttggac tggagcatta tacggtcatt aactatgttt 1680 ggtgagaggtccgtagaact agagcattca gtttgttcat ctcagttgag gatgttacgg 1740 gtcttggatctaacagatgc acaattttct atcacacaaa atgatgtcga caacatagtg 1800 ctcttgtgccacttgaaata cctatgcatt aagatacaga taccgttcac catatattta 1860 ttcacttccacaatccatag ctagactgca tggtctgcag acattggact tgggtcagac 1920 gtacatttcaacactgccaa ctcagattac taacttcgga gtctccgtag ccttcgatgc 1980 atgaaagaatatttttcttc ttctttaaca acatatttaa ctaacacatt atgcctgccc 2040 atgatattcacacctttcgt tagtacctcg gatcgttctg aaacaattgc taaattgcac 2100 atggccaccaaaggcttccg ttcaaaatca aatggtgtca aggtaccaaa aggaatatgt 2160 aataaaaaaactcgacttac aaatattgga ggtagtggat attagaagga ctagcagtag 2220 agcaatcaaagagttggggc agttaagcaa gctgaggaaa ttatgtgtgg taacaaaggg 2280 atccacaaaggaaaaatgtg agatactcta tacagctatc cagaagctct gtttcctaca 2340 atctctccatgtgaatgctg tgggattttc aggtattgga acacttcagt gtatagattc 2400 tatttcatctcctcctcccc tactgaggac actcaggttg aatggaagtc ttgaggagat 2460 gcctaactggattgagcagc tcacgcacct gatgaagttc aacttatgga ggagcaaact 2520 aaaagaaggtaaaaccatgt tggtacttgc ggcgttgccc aacctcatgg tcctttatct 2580 tcattccaatgcttaccatg gggagaagct agtattcaaa atgggagcat tcccaaatct 2640 tagaacattttcgatttaca atttggagca gctaagagag attagatttg aggacggcag 2700 ctcaatcttgttggaaaaga tagaaatatt caggggttgg aatcagggat tgttggtatc 2760 attcaccttccaaggctcaa ggagatttca cttggatacg gaagtaaagt ggctaggctt 2820 ggtcagctggagggagaagt gcgcacacac ccaaatcacc ccgtgctgcg aatgagggag 2880 gaccgaagtgatcacgacct tgcttgtgac gccgaaggat cccctgttga agtggaagca 2940 acagatcctgtgagagctca cagttgcagg tgatcacgtt gacaacgaac gacaggtcag 3000 tcactccctacacggcatct taatgaactt gttttatcct cttgtgagat cgatgatttt 3060 aactcaccctttcatctctc tcgttttctt aacctaacag cgaagagata agcacaactt 3120 aagctggtttgatcaagtga tgatctcctc ctccattggc atctccggtc gtccctgctt 3180 ctgcggctgcgcacctcgct gctccgagga ggggtgctga 3220 91 1054 PRT Oryza minuta 91 MetAla Asp Thr Val Leu Ser Ile Ala Lys Ser Leu Val Gly Ser Ala 1 5 10 15Val Ser Lys Val Ala Ser Val Ala Ala Asp Lys Met Ile Met Leu Leu 20 25 30Gly Val Gln Lys Glu Ile Trp Phe Ile Lys Asp Glu Leu Gln Thr Ile 35 40 45Gln Ala Phe Leu Ile Ala Ala Glu Ala Ser Lys Lys Ser Ile Leu Leu 50 55 60Lys Val Trp Val Gln Gln Val Arg Asp Leu Ser Tyr Asp Ile Glu Asp 65 70 7580 Cys Leu Asp Glu Phe Thr Val His Val Gly Ser Gln Asn Leu Ser Arg 85 9095 Gln Leu Met Lys Leu Lys Asp Arg His Arg Ile Ala Ile Gln Ile Arg 100105 110 Asn Leu Arg Thr Arg Ile Glu Glu Val Ser Thr Arg Asn Ile Arg Tyr115 120 125 Asn Leu Ile Glu Asn Asp Leu Thr Cys Thr Thr Thr Asp Glu ArgAsn 130 135 140 Leu Phe Met Glu Asp Ile Arg Asn Gln Ala Asn Asn Ile GluGlu Ala 145 150 155 160 Asp Leu Val Gly Phe Ser Gly Pro Lys Arg Glu LeuLeu Asp Leu Ile 165 170 175 Asp Val His Ala Asn Asp Gly Pro Thr Lys ValVal Cys Val Val Gly 180 185 190 Met Gly Gly Leu Gly Lys Thr Thr Ile AlaArg Lys Ile Tyr Glu Ser 195 200 205 Lys Glu Asp Ile Ala Lys Asn Phe SerCys Cys Ala Trp Ile Thr Val 210 215 220 Ser Gln Ser Phe Val Arg Val GluLeu Leu Lys Asp Leu Met Val Lys 225 230 235 240 Leu Phe Gly Glu Glu ValLeu Lys Lys Arg Leu Arg Glu Leu Glu Gly 245 250 255 Lys Val Pro Gln ValAsp Asp Leu Ala Ser Tyr Leu Arg Thr Glu Leu 260 265 270 Asn Glu Arg ArgTyr Phe Val Val Leu Asp Asn Val Trp Ser Thr Asp 275 280 285 Ser Trp LysTrp Ile Asn Ser Ile Ala Phe Pro Arg Arg Asn Asn Asn 290 295 300 Lys GlySer Arg Val Ile Val Thr Thr Arg Asp Val Gly Leu Thr Lys 305 310 315 320Glu Cys Thr Ser Glu Leu Leu Ile Tyr Gln Leu Lys Pro Leu Glu Ile 325 330335 Asn Tyr Ala Lys Glu Leu Leu Leu Arg Lys Ala Asn Lys Ala Ile Gly 340345 350 Asp Met Glu Ser Asp Lys Met Ser Asp Ile Ile Thr Lys Ile Ile Lys355 360 365 Lys Cys Gly Tyr Leu Pro Leu Ala Ile Leu Thr Ile Gly Gly ValLeu 370 375 380 Ser Thr Lys Glu Ile Arg Glu Trp Glu Thr Phe Tyr Ser GlnIle Pro 385 390 395 400 Ser Glu Leu Glu Ser Asn Pro Asn Leu Glu Ala MetArg Arg Ile Val 405 410 415 Thr Leu Ser Tyr Asn Tyr Leu Pro Ser His LeuLys Gln Cys Phe Leu 420 425 430 Tyr Leu Ser Ile Phe Pro Glu Asp Phe GluIle Asn Arg Asn Arg Leu 435 440 445 Val Asn Arg Trp Met Ala Glu Gly PheIle Lys Ala Arg Thr Asn Met 450 455 460 Thr Ile Glu Asp Val Gly Lys SerTyr Phe Lys Glu Leu Ile Asn Arg 465 470 475 480 Ser Met Ile Gln Pro SerArg Ala Gly Ile Arg Gly Asp Phe Lys Ser 485 490 495 Cys Arg Val His AspIle Met Arg Asp Ile Thr Ile Ser Ile Ser Arg 500 505 510 Glu Glu Asn PheThr Leu Leu Pro Asp Gly Thr Asp Tyr Asp Val Val 515 520 525 His Gly AsnThr Arg His Ile Glu Phe His Gly Ser Lys Tyr Cys Ser 530 535 540 Glu ThrSer Leu Asp Trp Ser Ile Ile Arg Ser Leu Thr Met Phe Gly 545 550 555 560Glu Arg Ser Val Glu Leu Glu His Ser Val Cys Ser Ser Gln Leu Arg 565 570575 Met Leu Arg Val Leu Asp Leu Thr Asp Ala Gln Phe Ser Ile Thr Gln 580585 590 Asn Asp Val Asp Asn Ile Val Leu Leu Cys His Leu Lys Tyr Leu Cys595 600 605 Ile Lys Ile Gln Ile Pro Phe Thr Ile Tyr Leu Phe Thr Ser ThrIle 610 615 620 His Ser Thr Ala Trp Ser Ala Asp Ile Gly Leu Gly Ser AspVal His 625 630 635 640 Phe Asn Thr Ala Asn Ser Asp Tyr Leu Arg Ser LeuArg Ser Leu Arg 645 650 655 Cys Met Lys Glu Tyr Phe Ser Ser Ser Leu ThrThr Thr Leu Thr Asn 660 665 670 Thr Leu Cys Leu Pro Met Ile Phe Thr ProPhe Val Ser Thr Ser Asp 675 680 685 Arg Ser Glu Thr Ile Ala Lys Leu HisMet Ala Thr Lys Gly Phe Arg 690 695 700 Ser Lys Ser Asn Gly Val Lys ValPro Lys Gly Ile Cys Asn Lys Lys 705 710 715 720 Thr Arg Leu Thr Asn IleGly Gly Ser Gly Tyr Lys Asp Gln Ser Asn 725 730 735 Gln Arg Val Gly AlaVal Lys Gln Ala Glu Glu Ile Met Cys Gly Asn 740 745 750 Lys Gly Ile HisLys Gly Lys Met Asp Thr Leu Tyr Ser Tyr Pro Glu 755 760 765 Ala Leu PhePro Thr Ile Ser Pro Cys Glu Cys Cys Gly Glu Phe Arg 770 775 780 Tyr TrpAsn Thr Ser Val Tyr Arg Phe Tyr Phe Ile Ser Ser Ser Pro 785 790 795 800Thr Glu Asp Thr Gln Val Glu Arg Lys Ser Gly Asp Ala Leu Asp Ala 805 810815 Ala His Ala Pro Asp Glu Val Gln Leu Met Glu Glu Gln Thr Lys Arg 820825 830 Arg Asn His Val Gly Thr Cys Gly Val Ala Gln Pro His Gly Pro Leu835 840 845 Ser Ser Phe Gln Cys Leu Pro Trp Gly Glu Ala Ser Ile Gln AsnGly 850 855 860 Ser Ile Pro Lys Ser Asn Ile Phe Asp Leu Gln Phe Gly AlaAla Lys 865 870 875 880 Arg Asp Ile Gly Arg Gln Leu Asn Leu Val Gly LysAsp Arg Asn Ile 885 890 895 Gln Gly Leu Glu Ser Gly Ile Val Gly Ile IleHis Leu Pro Arg Leu 900 905 910 Lys Glu Ile Ser Leu Gly Tyr Gly Ser LysVal Ala Arg Leu Gly Gln 915 920 925 Leu Glu Gly Glu Val Arg Thr His ProAsn His Pro Val Leu Arg Met 930 935 940 Arg Glu Asp Arg Ser Asp His AspLeu Ala Cys Asp Ala Glu Gly Ser 945 950 955 960 Pro Val Glu Val Glu AlaThr Asp Pro Val Arg Ala His Ser Cys Arg 965 970 975 Ser Arg Gln Arg ThrThr Gly Gln Ser Leu Pro Thr Arg His Leu Asn 980 985 990 Glu Leu Val LeuSer Ser Cys Glu Ile Asp Asp Phe Asn Ser Pro His 995 1000 1005 Leu SerArg Phe Leu Asn Leu Thr Ala Lys Arg Ala Gln Leu Lys 1010 1015 1020 LeuVal Ser Ser Asp Asp Leu Leu Leu His Trp His Leu Arg Ser 1025 1030 1035Ser Leu Leu Leu Arg Leu Arg Thr Ser Leu Leu Arg Gly Gly Val 1040 10451050 Leu 92 3276 DNA Oryza minuta CDS (1)..(3276) 92 atg gcg gag acg gtgctg agc atg gcg agg tcg ctg gtg ggc agc gcc 48 Met Ala Glu Thr Val LeuSer Met Ala Arg Ser Leu Val Gly Ser Ala 1 5 10 15 atc agc aag gcc gcctcc gct gct gcc gac gag acc agc ctc ctg ctg 96 Ile Ser Lys Ala Ala SerAla Ala Ala Asp Glu Thr Ser Leu Leu Leu 20 25 30 ggc gtc gag aaa gac atctgg tat atc aaa gat gag ttg aaa ata atg 144 Gly Val Glu Lys Asp Ile TrpTyr Ile Lys Asp Glu Leu Lys Ile Met 35 40 45 cag gca ttc ctt aga gct gcagaa gtt atg aaa aag aaa gac gaa cta 192 Gln Ala Phe Leu Arg Ala Ala GluVal Met Lys Lys Lys Asp Glu Leu 50 55 60 tta aag gtt tgg gca gag caa atacgt gac ctg tta tat gac att gaa 240 Leu Lys Val Trp Ala Glu Gln Ile ArgAsp Leu Leu Tyr Asp Ile Glu 65 70 75 80 gat tcc ctt gat gaa ttt aag gtccat gtt gaa agt caa acc cta ttt 288 Asp Ser Leu Asp Glu Phe Lys Val HisVal Glu Ser Gln Thr Leu Phe 85 90 95 cgt cag tat atc aaa gat gag cta aaaacg atg caa gca ttc ctt aga 336 Arg Gln Tyr Ile Lys Asp Glu Leu Lys ThrMet Gln Ala Phe Leu Arg 100 105 110 gct gct gaa ctt atg aaa aag aaa gatgaa cta tta aag gtt tgg gca 384 Ala Ala Glu Leu Met Lys Lys Lys Asp GluLeu Leu Lys Val Trp Ala 115 120 125 gag caa ata cgt gac ctg tca tat gacatt gaa gat tcc ctt gat gaa 432 Glu Gln Ile Arg Asp Leu Ser Tyr Asp IleGlu Asp Ser Leu Asp Glu 130 135 140 ttt aag gtc cat att gaa agc caa acccta ttt cgt cag ttg gtg aaa 480 Phe Lys Val His Ile Glu Ser Gln Thr LeuPhe Arg Gln Leu Val Lys 145 150 155 160 ctc aga gaa cgc cac cga att gctatc cgt atc cac aac ctt aaa tca 528 Leu Arg Glu Arg His Arg Ile Ala IleArg Ile His Asn Leu Lys Ser 165 170 175 aga gtt gaa gaa gtg agt agc aggaac aca cgc tac agt tta gtc aag 576 Arg Val Glu Glu Val Ser Ser Arg AsnThr Arg Tyr Ser Leu Val Lys 180 185 190 cct att tcc tct ggc aca gag attgac atg gat tcc tat gca gaa gac 624 Pro Ile Ser Ser Gly Thr Glu Ile AspMet Asp Ser Tyr Ala Glu Asp 195 200 205 att cgt aat cag tca gct cgc aatgtg gat gaa gct gag ctt gtt ggg 672 Ile Arg Asn Gln Ser Ala Arg Asn ValAsp Glu Ala Glu Leu Val Gly 210 215 220 ttt tct gac tcc aag aaa agg ttgctt gaa atg atc gat acc aat gct 720 Phe Ser Asp Ser Lys Lys Arg Leu LeuGlu Met Ile Asp Thr Asn Ala 225 230 235 240 aat gat ggt ccg gcc aaa gtaatc tgt gtt gtt ggg atg ggt ggt tta 768 Asn Asp Gly Pro Ala Lys Val IleCys Val Val Gly Met Gly Gly Leu 245 250 255 ggc aag aca gct ctt tcg aggaag atc ttt gaa agc gaa gaa gac att 816 Gly Lys Thr Ala Leu Ser Arg LysIle Phe Glu Ser Glu Glu Asp Ile 260 265 270 agg aag aac ttc cct tgc aatgct tgg att aca gtg tca caa tca ttt 864 Arg Lys Asn Phe Pro Cys Asn AlaTrp Ile Thr Val Ser Gln Ser Phe 275 280 285 cac agg att gag cta ctc aaagat atg ata cgc caa ctt cta ggt ccc 912 His Arg Ile Glu Leu Leu Lys AspMet Ile Arg Gln Leu Leu Gly Pro 290 295 300 aat tct ctg aaa caa ctc ttgcaa gaa ttg caa ggg aag gtg gtg gtg 960 Asn Ser Leu Lys Gln Leu Leu GlnGlu Leu Gln Gly Lys Val Val Val 305 310 315 320 caa gta cat cat ctt tctgag tac ctg ata gaa gag ctc aag gag aag 1008 Gln Val His His Leu Ser GluTyr Leu Ile Glu Glu Leu Lys Glu Lys 325 330 335 agg tac ttt gtt att ctagat gat cta tgg att tta cat gat tgg aat 1056 Arg Tyr Phe Val Ile Leu AspAsp Leu Trp Ile Leu His Asp Trp Asn 340 345 350 tgg ata aat gaa att gcattt cct aag aac aat aag aag ggc agt cga 1104 Trp Ile Asn Glu Ile Ala PhePro Lys Asn Asn Lys Lys Gly Ser Arg 355 360 365 ata gta ata acc act cggaat gtt gat cta gcg gag aag tgt gcc aca 1152 Ile Val Ile Thr Thr Arg AsnVal Asp Leu Ala Glu Lys Cys Ala Thr 370 375 380 gcc tca ctg gtg tac cacctt gat ttc ttg cag atg aac gat gcc ata 1200 Ala Ser Leu Val Tyr His LeuAsp Phe Leu Gln Met Asn Asp Ala Ile 385 390 395 400 aca ttg cta cta agaaaa aca aat aaa aat cat gaa gac atg gaa tca 1248 Thr Leu Leu Leu Arg LysThr Asn Lys Asn His Glu Asp Met Glu Ser 405 410 415 aat aaa aat atg caaaag atg gtt gaa cga att gta aat aaa tgt ggt 1296 Asn Lys Asn Met Gln LysMet Val Glu Arg Ile Val Asn Lys Cys Gly 420 425 430 cgt cta cca tta gcaata ctt aca ata gga gct gtg ctt gca act aaa 1344 Arg Leu Pro Leu Ala IleLeu Thr Ile Gly Ala Val Leu Ala Thr Lys 435 440 445 cat gtg tca gaa tgggag aaa ttc tat gaa cag ctt cct tca gaa cta 1392 His Val Ser Glu Trp GluLys Phe Tyr Glu Gln Leu Pro Ser Glu Leu 450 455 460 gaa ata aac cca agcctg gaa gct ttg agg aga atg gtg acc cta ggt 1440 Glu Ile Asn Pro Ser LeuGlu Ala Leu Arg Arg Met Val Thr Leu Gly 465 470 475 480 tac aac cac ctacca tcc cat ctg aaa cca tgc ttt ttg tat cta agt 1488 Tyr Asn His Leu ProSer His Leu Lys Pro Cys Phe Leu Tyr Leu Ser 485 490 495 atc ttt cct gaggat ttt gaa atc aaa agg aat cgt ctg gta ggt aga 1536 Ile Phe Pro Glu AspPhe Glu Ile Lys Arg Asn Arg Leu Val Gly Arg 500 505 510 tgg ata gca gaaggg ttt gtt aga ccg cag gtt ggg atg acg act aag 1584 Trp Ile Ala Glu GlyPhe Val Arg Pro Gln Val Gly Met Thr Thr Lys 515 520 525 gat gtc gga gaaagt tac ttt aat gag cta atc agc cga agt atg att 1632 Asp Val Gly Glu SerTyr Phe Asn Glu Leu Ile Ser Arg Ser Met Ile 530 535 540 caa cga tca agagtg ggc ata tca gga aaa att aag act tgt cga atc 1680 Gln Arg Ser Arg ValGly Ile Ser Gly Lys Ile Lys Thr Cys Arg Ile 545 550 555 560 cat gat atcatc cgt gat atc aca gtt tca atc tcg aga cag gaa aat 1728 His Asp Ile IleArg Asp Ile Thr Val Ser Ile Ser Arg Gln Glu Asn 565 570 575 ttt gta ttgtta cca atg gga gat ggc tct gat tta gtt cag gaa aac 1776 Phe Val Leu LeuPro Met Gly Asp Gly Ser Asp Leu Val Gln Glu Asn 580 585 590 act cgc cacata gca ttc cat ggg agt atg tcc tgc aaa aca gga ttg 1824 Thr Arg His IleAla Phe His Gly Ser Met Ser Cys Lys Thr Gly Leu 595 600 605 gat tgg agcatt att cga tca tta gct att ttt ggt gac aga ccc aag 1872 Asp Trp Ser IleIle Arg Ser Leu Ala Ile Phe Gly Asp Arg Pro Lys 610 615 620 agt cta gcacat gca gtt tgt cta gat caa ttg agg atg tta cgg gtc 1920 Ser Leu Ala HisAla Val Cys Leu Asp Gln Leu Arg Met Leu Arg Val 625 630 635 640 ttg gatctt gaa gat gtg aca ttc tta atc act caa aaa gat ttc gac 1968 Leu Asp LeuGlu Asp Val Thr Phe Leu Ile Thr Gln Lys Asp Phe Asp 645 650 655 cgt attgca ttg ttg tgc cac ttg aaa tac ttg agt att gga tat tcg 2016 Arg Ile AlaLeu Leu Cys His Leu Lys Tyr Leu Ser Ile Gly Tyr Ser 660 665 670 tca tccata tat tca ctt ccc aga tcc att ggt aaa cta cag ggc cta 2064 Ser Ser IleTyr Ser Leu Pro Arg Ser Ile Gly Lys Leu Gln Gly Leu 675 680 685 caa actttg aac atg ctg aga aca tac att gca gca cta cca agt gag 2112 Gln Thr LeuAsn Met Leu Arg Thr Tyr Ile Ala Ala Leu Pro Ser Glu 690 695 700 atc agtaaa ctc caa tgt ctg cat act ctt cgt tgt agt aga aag ttt 2160 Ile Ser LysLeu Gln Cys Leu His Thr Leu Arg Cys Ser Arg Lys Phe 705 710 715 720 gtttat gac aac ttt agt cta aac cac cca atg aag tgc ata act aac 2208 Val TyrAsp Asn Phe Ser Leu Asn His Pro Met Lys Cys Ile Thr Asn 725 730 735 acaata tgc ctg cct aaa gta ttc aca cct tta gtt agt cgc gat gat 2256 Thr IleCys Leu Pro Lys Val Phe Thr Pro Leu Val Ser Arg Asp Asp 740 745 750 cgtgca aaa caa att gct gaa ttg cac atg gcc acc aaa agt tgc tgg 2304 Arg AlaLys Gln Ile Ala Glu Leu His Met Ala Thr Lys Ser Cys Trp 755 760 765 tctgaa tca ttc ggt gtg aag gta ccc aaa gga ata ggt aag ttg cga 2352 Ser GluSer Phe Gly Val Lys Val Pro Lys Gly Ile Gly Lys Leu Arg 770 775 780 gacttg cag gtt cta gag tat gta gat atc agg cgg acc agt agt aga 2400 Asp LeuGln Val Leu Glu Tyr Val Asp Ile Arg Arg Thr Ser Ser Arg 785 790 795 800gca atc aaa gag ctg ggg cac tta agc aag ttg agg aaa tta ggt gtg 2448 AlaIle Lys Glu Leu Gly His Leu Ser Lys Leu Arg Lys Leu Gly Val 805 810 815ata aca aaa ggc tcg aca aag gaa aaa tgt aag ata ctt tat gca gcc 2496 IleThr Lys Gly Ser Thr Lys Glu Lys Cys Lys Ile Leu Tyr Ala Ala 820 825 830att gag aag ctc tct tcc ctc caa tct ctc tat gtg aat gct gcg tta 2544 IleGlu Lys Leu Ser Ser Leu Gln Ser Leu Tyr Val Asn Ala Ala Leu 835 840 845tta tca gat att gaa aca ctt gag tgc cta gat tct att tca tct cct 2592 LeuSer Asp Ile Glu Thr Leu Glu Cys Leu Asp Ser Ile Ser Ser Pro 850 855 860cct ccc cta ctg agg aca ctc ggg ttg aat gga agt ctt gaa gag atg 2640 ProPro Leu Leu Arg Thr Leu Gly Leu Asn Gly Ser Leu Glu Glu Met 865 870 875880 cct aac tgg att gag cag ctc act cac ctg aag aag atc tac tta ttg 2688Pro Asn Trp Ile Glu Gln Leu Thr His Leu Lys Lys Ile Tyr Leu Leu 885 890895 agg agc aaa cta aag gaa ggt aaa acc atg ctg ata ctt ggg gca ttg 2736Arg Ser Lys Leu Lys Glu Gly Lys Thr Met Leu Ile Leu Gly Ala Leu 900 905910 ccc aac ctc atg gtc ctt tat ctt tat tgg aat gct tac ctt ggg gag 2784Pro Asn Leu Met Val Leu Tyr Leu Tyr Trp Asn Ala Tyr Leu Gly Glu 915 920925 aag cta gta ttc aaa acg gga gca ttc cca aat ctt aga aca ctt cgt 2832Lys Leu Val Phe Lys Thr Gly Ala Phe Pro Asn Leu Arg Thr Leu Arg 930 935940 att tac gaa ttg gat cag cta aga gag atg aga ttt gag gat ggc agc 2880Ile Tyr Glu Leu Asp Gln Leu Arg Glu Met Arg Phe Glu Asp Gly Ser 945 950955 960 tca ccc ctg ttg gaa aag ata gaa atc tct tgc tgc agg ttg gaa tca2928 Ser Pro Leu Leu Glu Lys Ile Glu Ile Ser Cys Cys Arg Leu Glu Ser 965970 975 ggg att att ggt atc att cac ctt cca agg ctc aag gag att tca ctt2976 Gly Ile Ile Gly Ile Ile His Leu Pro Arg Leu Lys Glu Ile Ser Leu 980985 990 gaa tac aaa agt aaa gtg gct agg ctt ggt cag ctg gag gga gaa gtg3024 Glu Tyr Lys Ser Lys Val Ala Arg Leu Gly Gln Leu Glu Gly Glu Val 9951000 1005 aac aca cac cca aat cgc ccc gtg ctg cga atg gac agt gac cga3069 Asn Thr His Pro Asn Arg Pro Val Leu Arg Met Asp Ser Asp Arg 10101015 1020 agg gat cac gac ctg ggg gct gaa gcc gaa gga tct tct ata gaa3114 Arg Asp His Asp Leu Gly Ala Glu Ala Glu Gly Ser Ser Ile Glu 10251030 1035 gtg caa aca gca gat cct gtt cct gat gcc gaa gga tca gtc act3159 Val Gln Thr Ala Asp Pro Val Pro Asp Ala Glu Gly Ser Val Thr 10401045 1050 gta gca gtg gaa gca acg gat ccc ctt ccc gag cag gag gga gag3204 Val Ala Val Glu Ala Thr Asp Pro Leu Pro Glu Gln Glu Gly Glu 10551060 1065 agc tcg cag tcg cag gtg atc acg ttg acg acg aac gat agc gaa3249 Ser Ser Gln Ser Gln Val Ile Thr Leu Thr Thr Asn Asp Ser Glu 10701075 1080 gag ata ggc aca gct caa gct ggc tga 3276 Glu Ile Gly Thr AlaGln Ala Gly 1085 1090 93 1091 PRT Oryza minuta 93 Met Ala Glu Thr ValLeu Ser Met Ala Arg Ser Leu Val Gly Ser Ala 1 5 10 15 Ile Ser Lys AlaAla Ser Ala Ala Ala Asp Glu Thr Ser Leu Leu Leu 20 25 30 Gly Val Glu LysAsp Ile Trp Tyr Ile Lys Asp Glu Leu Lys Ile Met 35 40 45 Gln Ala Phe LeuArg Ala Ala Glu Val Met Lys Lys Lys Asp Glu Leu 50 55 60 Leu Lys Val TrpAla Glu Gln Ile Arg Asp Leu Leu Tyr Asp Ile Glu 65 70 75 80 Asp Ser LeuAsp Glu Phe Lys Val His Val Glu Ser Gln Thr Leu Phe 85 90 95 Arg Gln TyrIle Lys Asp Glu Leu Lys Thr Met Gln Ala Phe Leu Arg 100 105 110 Ala AlaGlu Leu Met Lys Lys Lys Asp Glu Leu Leu Lys Val Trp Ala 115 120 125 GluGln Ile Arg Asp Leu Ser Tyr Asp Ile Glu Asp Ser Leu Asp Glu 130 135 140Phe Lys Val His Ile Glu Ser Gln Thr Leu Phe Arg Gln Leu Val Lys 145 150155 160 Leu Arg Glu Arg His Arg Ile Ala Ile Arg Ile His Asn Leu Lys Ser165 170 175 Arg Val Glu Glu Val Ser Ser Arg Asn Thr Arg Tyr Ser Leu ValLys 180 185 190 Pro Ile Ser Ser Gly Thr Glu Ile Asp Met Asp Ser Tyr AlaGlu Asp 195 200 205 Ile Arg Asn Gln Ser Ala Arg Asn Val Asp Glu Ala GluLeu Val Gly 210 215 220 Phe Ser Asp Ser Lys Lys Arg Leu Leu Glu Met IleAsp Thr Asn Ala 225 230 235 240 Asn Asp Gly Pro Ala Lys Val Ile Cys ValVal Gly Met Gly Gly Leu 245 250 255 Gly Lys Thr Ala Leu Ser Arg Lys IlePhe Glu Ser Glu Glu Asp Ile 260 265 270 Arg Lys Asn Phe Pro Cys Asn AlaTrp Ile Thr Val Ser Gln Ser Phe 275 280 285 His Arg Ile Glu Leu Leu LysAsp Met Ile Arg Gln Leu Leu Gly Pro 290 295 300 Asn Ser Leu Lys Gln LeuLeu Gln Glu Leu Gln Gly Lys Val Val Val 305 310 315 320 Gln Val His HisLeu Ser Glu Tyr Leu Ile Glu Glu Leu Lys Glu Lys 325 330 335 Arg Tyr PheVal Ile Leu Asp Asp Leu Trp Ile Leu His Asp Trp Asn 340 345 350 Trp IleAsn Glu Ile Ala Phe Pro Lys Asn Asn Lys Lys Gly Ser Arg 355 360 365 IleVal Ile Thr Thr Arg Asn Val Asp Leu Ala Glu Lys Cys Ala Thr 370 375 380Ala Ser Leu Val Tyr His Leu Asp Phe Leu Gln Met Asn Asp Ala Ile 385 390395 400 Thr Leu Leu Leu Arg Lys Thr Asn Lys Asn His Glu Asp Met Glu Ser405 410 415 Asn Lys Asn Met Gln Lys Met Val Glu Arg Ile Val Asn Lys CysGly 420 425 430 Arg Leu Pro Leu Ala Ile Leu Thr Ile Gly Ala Val Leu AlaThr Lys 435 440 445 His Val Ser Glu Trp Glu Lys Phe Tyr Glu Gln Leu ProSer Glu Leu 450 455 460 Glu Ile Asn Pro Ser Leu Glu Ala Leu Arg Arg MetVal Thr Leu Gly 465 470 475 480 Tyr Asn His Leu Pro Ser His Leu Lys ProCys Phe Leu Tyr Leu Ser 485 490 495 Ile Phe Pro Glu Asp Phe Glu Ile LysArg Asn Arg Leu Val Gly Arg 500 505 510 Trp Ile Ala Glu Gly Phe Val ArgPro Gln Val Gly Met Thr Thr Lys 515 520 525 Asp Val Gly Glu Ser Tyr PheAsn Glu Leu Ile Ser Arg Ser Met Ile 530 535 540 Gln Arg Ser Arg Val GlyIle Ser Gly Lys Ile Lys Thr Cys Arg Ile 545 550 555 560 His Asp Ile IleArg Asp Ile Thr Val Ser Ile Ser Arg Gln Glu Asn 565 570 575 Phe Val LeuLeu Pro Met Gly Asp Gly Ser Asp Leu Val Gln Glu Asn 580 585 590 Thr ArgHis Ile Ala Phe His Gly Ser Met Ser Cys Lys Thr Gly Leu 595 600 605 AspTrp Ser Ile Ile Arg Ser Leu Ala Ile Phe Gly Asp Arg Pro Lys 610 615 620Ser Leu Ala His Ala Val Cys Leu Asp Gln Leu Arg Met Leu Arg Val 625 630635 640 Leu Asp Leu Glu Asp Val Thr Phe Leu Ile Thr Gln Lys Asp Phe Asp645 650 655 Arg Ile Ala Leu Leu Cys His Leu Lys Tyr Leu Ser Ile Gly TyrSer 660 665 670 Ser Ser Ile Tyr Ser Leu Pro Arg Ser Ile Gly Lys Leu GlnGly Leu 675 680 685 Gln Thr Leu Asn Met Leu Arg Thr Tyr Ile Ala Ala LeuPro Ser Glu 690 695 700 Ile Ser Lys Leu Gln Cys Leu His Thr Leu Arg CysSer Arg Lys Phe 705 710 715 720 Val Tyr Asp Asn Phe Ser Leu Asn His ProMet Lys Cys Ile Thr Asn 725 730 735 Thr Ile Cys Leu Pro Lys Val Phe ThrPro Leu Val Ser Arg Asp Asp 740 745 750 Arg Ala Lys Gln Ile Ala Glu LeuHis Met Ala Thr Lys Ser Cys Trp 755 760 765 Ser Glu Ser Phe Gly Val LysVal Pro Lys Gly Ile Gly Lys Leu Arg 770 775 780 Asp Leu Gln Val Leu GluTyr Val Asp Ile Arg Arg Thr Ser Ser Arg 785 790 795 800 Ala Ile Lys GluLeu Gly His Leu Ser Lys Leu Arg Lys Leu Gly Val 805 810 815 Ile Thr LysGly Ser Thr Lys Glu Lys Cys Lys Ile Leu Tyr Ala Ala 820 825 830 Ile GluLys Leu Ser Ser Leu Gln Ser Leu Tyr Val Asn Ala Ala Leu 835 840 845 LeuSer Asp Ile Glu Thr Leu Glu Cys Leu Asp Ser Ile Ser Ser Pro 850 855 860Pro Pro Leu Leu Arg Thr Leu Gly Leu Asn Gly Ser Leu Glu Glu Met 865 870875 880 Pro Asn Trp Ile Glu Gln Leu Thr His Leu Lys Lys Ile Tyr Leu Leu885 890 895 Arg Ser Lys Leu Lys Glu Gly Lys Thr Met Leu Ile Leu Gly AlaLeu 900 905 910 Pro Asn Leu Met Val Leu Tyr Leu Tyr Trp Asn Ala Tyr LeuGly Glu 915 920 925 Lys Leu Val Phe Lys Thr Gly Ala Phe Pro Asn Leu ArgThr Leu Arg 930 935 940 Ile Tyr Glu Leu Asp Gln Leu Arg Glu Met Arg PheGlu Asp Gly Ser 945 950 955 960 Ser Pro Leu Leu Glu Lys Ile Glu Ile SerCys Cys Arg Leu Glu Ser 965 970 975 Gly Ile Ile Gly Ile Ile His Leu ProArg Leu Lys Glu Ile Ser Leu 980 985 990 Glu Tyr Lys Ser Lys Val Ala ArgLeu Gly Gln Leu Glu Gly Glu Val 995 1000 1005 Asn Thr His Pro Asn ArgPro Val Leu Arg Met Asp Ser Asp Arg 1010 1015 1020 Arg Asp His Asp LeuGly Ala Glu Ala Glu Gly Ser Ser Ile Glu 1025 1030 1035 Val Gln Thr AlaAsp Pro Val Pro Asp Ala Glu Gly Ser Val Thr 1040 1045 1050 Val Ala ValGlu Ala Thr Asp Pro Leu Pro Glu Gln Glu Gly Glu 1055 1060 1065 Ser SerGln Ser Gln Val Ile Thr Leu Thr Thr Asn Asp Ser Glu 1070 1075 1080 GluIle Gly Thr Ala Gln Ala Gly 1085 1090 94 1355 DNA Oryza minuta CDS(1)..(1353) 94 atg gcg gat aca gta ctc agc att gca aag tcc ctg gtg ggaagt gct 48 Met Ala Asp Thr Val Leu Ser Ile Ala Lys Ser Leu Val Gly SerAla 1 5 10 15 gta agt aag gtt gct tcg gtt gct gcc gac aag atg atc ttgctg ctg 96 Val Ser Lys Val Ala Ser Val Ala Ala Asp Lys Met Ile Leu LeuLeu 20 25 30 gga gtg cag aag gag ata tgg ttc atc aaa gac gag cta caa acgata 144 Gly Val Gln Lys Glu Ile Trp Phe Ile Lys Asp Glu Leu Gln Thr Ile35 40 45 caa gca ttt ttg atg gct gcc gaa gca tca aag aaa agc ata ctg ttg192 Gln Ala Phe Leu Met Ala Ala Glu Ala Ser Lys Lys Ser Ile Leu Leu 5055 60 aag gtt tgg gtg cag caa gta agg gat cta tcc tat gac att gaa gat240 Lys Val Trp Val Gln Gln Val Arg Asp Leu Ser Tyr Asp Ile Glu Asp 6570 75 80 tgc ctt gat gaa ttt aca gtt cat gtg ggc agc caa acc ttg tcg agg288 Cys Leu Asp Glu Phe Thr Val His Val Gly Ser Gln Thr Leu Ser Arg 8590 95 cag ttg atg aag cta aag gat cgc cat cgg atc gcc gtt cag atc cgc336 Gln Leu Met Lys Leu Lys Asp Arg His Arg Ile Ala Val Gln Ile Arg 100105 110 aat ctc agg aca aga atc gaa gaa gtg agc agt agg aac aca cgc tac384 Asn Leu Arg Thr Arg Ile Glu Glu Val Ser Ser Arg Asn Thr Arg Tyr 115120 125 aac tta ata gag aat gac ctt acc agc acc act ggt gag agg aat ttc432 Asn Leu Ile Glu Asn Asp Leu Thr Ser Thr Thr Gly Glu Arg Asn Phe 130135 140 att atg gaa gac att cga aat gaa tca gct aac aat atc gag gaa gct480 Ile Met Glu Asp Ile Arg Asn Glu Ser Ala Asn Asn Ile Glu Glu Ala 145150 155 160 gag ctt gtg ggt ttt tcg gga ccc aaa aga gag ttg ctt gat cttata 528 Glu Leu Val Gly Phe Ser Gly Pro Lys Arg Glu Leu Leu Asp Leu Ile165 170 175 gat gtc cat gcc aag gat gga cct aca aag gtt gta tgt gtt gtcggt 576 Asp Val His Ala Lys Asp Gly Pro Thr Lys Val Val Cys Val Val Gly180 185 190 atg ggt ggt ttg ggt aag act act att gca agg aaa att tat gaaagc 624 Met Gly Gly Leu Gly Lys Thr Thr Ile Ala Arg Lys Ile Tyr Glu Ser195 200 205 aaa gag gac att gca aag aat ttt tct tgc tgt gct tgg att actgtt 672 Lys Glu Asp Ile Ala Lys Asn Phe Ser Cys Cys Ala Trp Ile Thr Val210 215 220 tca cag tcc ttt gtt agg ttg gaa cta ctc aag gat ttg atg gtgaaa 720 Ser Gln Ser Phe Val Arg Leu Glu Leu Leu Lys Asp Leu Met Val Lys225 230 235 240 ctt ttt gga gag gaa gta ctc aag aag cgg ctg aga gaa cttgaa ggg 768 Leu Phe Gly Glu Glu Val Leu Lys Lys Arg Leu Arg Glu Leu GluGly 245 250 255 aag gtt cca caa gta gac gac ctc gcc agc tac ctc agg acagag tta 816 Lys Val Pro Gln Val Asp Asp Leu Ala Ser Tyr Leu Arg Thr GluLeu 260 265 270 aat gaa agg agg tac ttt gtt gtg ctt gat gat gtg tgg agtaca gat 864 Asn Glu Arg Arg Tyr Phe Val Val Leu Asp Asp Val Trp Ser ThrAsp 275 280 285 tca tgg aaa tgg att aat agt att gcc ttc cct aga aat aacaat aaa 912 Ser Trp Lys Trp Ile Asn Ser Ile Ala Phe Pro Arg Asn Asn AsnLys 290 295 300 ggg agc cgg gtg ata gta aca aca aga gat gtt ggc tta gctaag gag 960 Gly Ser Arg Val Ile Val Thr Thr Arg Asp Val Gly Leu Ala LysGlu 305 310 315 320 tgt act tct gaa ttg ctt atc tac cgg ctt aaa ccc ctagaa ata acc 1008 Cys Thr Ser Glu Leu Leu Ile Tyr Arg Leu Lys Pro Leu GluIle Thr 325 330 335 tat gcg aaa gag ttg ctt cta agg aaa gca aat aaa aaaatt gaa gat 1056 Tyr Ala Lys Glu Leu Leu Leu Arg Lys Ala Asn Lys Lys IleGlu Asp 340 345 350 atg aaa agt gat aaa aag atg agt gac atc att act aaaata gta aaa 1104 Met Lys Ser Asp Lys Lys Met Ser Asp Ile Ile Thr Lys IleVal Lys 355 360 365 aag tgt ggc tat tta cca ctg gct ata ctc aca ata ggaggc gtg ctt 1152 Lys Cys Gly Tyr Leu Pro Leu Ala Ile Leu Thr Ile Gly GlyVal Leu 370 375 380 gcc acc aaa gat cca tca aga gcg ggt gta cga gga gaattt aag agc 1200 Ala Thr Lys Asp Pro Ser Arg Ala Gly Val Arg Gly Glu PheLys Ser 385 390 395 400 tgt cga atc cac gac atc atg cgt gat att gca atttcg att tct aga 1248 Cys Arg Ile His Asp Ile Met Arg Asp Ile Ala Ile SerIle Ser Arg 405 410 415 gag gaa aat ttc ata ctc tta cct gag ggc act gactat gac gta gtg 1296 Glu Glu Asn Phe Ile Leu Leu Pro Glu Gly Thr Asp TyrAsp Val Val 420 425 430 cat ggg aac act cgg cac ata gca ttt cac ggg agtaag tat tgc tct 1344 His Gly Asn Thr Arg His Ile Ala Phe His Gly Ser LysTyr Cys Ser 435 440 445 gaa aca agc tt 1355 Glu Thr Ser 450 95 451 PRTOryza minuta 95 Met Ala Asp Thr Val Leu Ser Ile Ala Lys Ser Leu Val GlySer Ala 1 5 10 15 Val Ser Lys Val Ala Ser Val Ala Ala Asp Lys Met IleLeu Leu Leu 20 25 30 Gly Val Gln Lys Glu Ile Trp Phe Ile Lys Asp Glu LeuGln Thr Ile 35 40 45 Gln Ala Phe Leu Met Ala Ala Glu Ala Ser Lys Lys SerIle Leu Leu 50 55 60 Lys Val Trp Val Gln Gln Val Arg Asp Leu Ser Tyr AspIle Glu Asp 65 70 75 80 Cys Leu Asp Glu Phe Thr Val His Val Gly Ser GlnThr Leu Ser Arg 85 90 95 Gln Leu Met Lys Leu Lys Asp Arg His Arg Ile AlaVal Gln Ile Arg 100 105 110 Asn Leu Arg Thr Arg Ile Glu Glu Val Ser SerArg Asn Thr Arg Tyr 115 120 125 Asn Leu Ile Glu Asn Asp Leu Thr Ser ThrThr Gly Glu Arg Asn Phe 130 135 140 Ile Met Glu Asp Ile Arg Asn Glu SerAla Asn Asn Ile Glu Glu Ala 145 150 155 160 Glu Leu Val Gly Phe Ser GlyPro Lys Arg Glu Leu Leu Asp Leu Ile 165 170 175 Asp Val His Ala Lys AspGly Pro Thr Lys Val Val Cys Val Val Gly 180 185 190 Met Gly Gly Leu GlyLys Thr Thr Ile Ala Arg Lys Ile Tyr Glu Ser 195 200 205 Lys Glu Asp IleAla Lys Asn Phe Ser Cys Cys Ala Trp Ile Thr Val 210 215 220 Ser Gln SerPhe Val Arg Leu Glu Leu Leu Lys Asp Leu Met Val Lys 225 230 235 240 LeuPhe Gly Glu Glu Val Leu Lys Lys Arg Leu Arg Glu Leu Glu Gly 245 250 255Lys Val Pro Gln Val Asp Asp Leu Ala Ser Tyr Leu Arg Thr Glu Leu 260 265270 Asn Glu Arg Arg Tyr Phe Val Val Leu Asp Asp Val Trp Ser Thr Asp 275280 285 Ser Trp Lys Trp Ile Asn Ser Ile Ala Phe Pro Arg Asn Asn Asn Lys290 295 300 Gly Ser Arg Val Ile Val Thr Thr Arg Asp Val Gly Leu Ala LysGlu 305 310 315 320 Cys Thr Ser Glu Leu Leu Ile Tyr Arg Leu Lys Pro LeuGlu Ile Thr 325 330 335 Tyr Ala Lys Glu Leu Leu Leu Arg Lys Ala Asn LysLys Ile Glu Asp 340 345 350 Met Lys Ser Asp Lys Lys Met Ser Asp Ile IleThr Lys Ile Val Lys 355 360 365 Lys Cys Gly Tyr Leu Pro Leu Ala Ile LeuThr Ile Gly Gly Val Leu 370 375 380 Ala Thr Lys Asp Pro Ser Arg Ala GlyVal Arg Gly Glu Phe Lys Ser 385 390 395 400 Cys Arg Ile His Asp Ile MetArg Asp Ile Ala Ile Ser Ile Ser Arg 405 410 415 Glu Glu Asn Phe Ile LeuLeu Pro Glu Gly Thr Asp Tyr Asp Val Val 420 425 430 His Gly Asn Thr ArgHis Ile Ala Phe His Gly Ser Lys Tyr Cys Ser 435 440 445 Glu Thr Ser 45096 2265 DNA Oryza minuta 96 tacgccagct ctaatacgac tcactatagg gcaaagctggtacgcctgca ggtaccggtc 60 cggaattccc gggtcgaccc acgcgtccga acaaattgctgaattgcaca tggccaccaa 120 aagttgctgg tctgaatcat tcggtgtgaa ggtacccaaaggaataggta agttgcgaga 180 cttgcaggtt ctagagtatg tagatatcag gcggaccagtagtagagcaa tcaaagagct 240 ggggcactta agcaagttga ggaaattagg tgtgataacaaaaggctcga caaaggaaaa 300 atgtaagata ctttatgcag ccattgagaa gctctcttccctccaatctc tctatgtgaa 360 tgctgcgtta ttatcagata ttgaaacact tgagtgcctagattctattt catctcctcc 420 tcccctactg aggacactcg ggttgaatgg aagtcttgaagagatgccta actggattga 480 gcagctcact cacctgaaga agatctactt attgaggagcaaactaaagg aaggtaaaac 540 catgctgata cttggggcat tgcccaacct catggtcctttatctttatt ggaatgctta 600 ccttggggag aagctagtat tcaaaacggg agcatttccaaaatcttaga acacttcgta 660 tttacgaatt ggatcagcta agagagatga gatttgaggatggcagctca cccctgttgg 720 aaaagataga aatctcttgc tgcaggttgg aatcagggattattggtatc attcaccttc 780 caaggctcaa ggagatttca cttgaataca aaagtaaagtggctaggctt ggtcagctgg 840 agggagaagt gaacacacac ccaaatcgcc ccgtgctgcgaatggacagt gaccgaaggg 900 atcacgacct gggggctgaa gccgaaggat cttctatagaagtgcaaaca gcagatcctg 960 ttcctgatgc cgaaggatca gtcactgtag cagtggaagcaacggatccc cttcccgagc 1020 aggagggaga gagctcgcag tcgcaggtga tcacgttgacgacgaacgat agcgaagaga 1080 taggcacagc tcaagctggc tgacgatctc ctcccccatcagcgtcgtca tcagcgaaca 1140 gatagggcag ggcttccctg cttctgcgtg cacctcactgctctgactct gagggacatg 1200 atgatcaatg aggcttccag ttcccaaatg tgcgcctaacacactcagtc attcccatcc 1260 aagagagttt gctagggtgg agatgcggtc atgtggatgatcaaatggac tacatcagac 1320 gcatcacact gctgcccaac ctttacgcac tgttgacaaatggagtgcag gtcctaaacc 1380 aggccagaag tttgttcagt gttcatgttc ccaaacatacattctggatg gcagaatgac 1440 ttgctctcaa tggttgaagc tgcaatcttc caggctcatttctggctgaa cccaaattgg 1500 tggcttggag gctggagctg catgacatca gagatcacaatggccacttt ttgttgccat 1560 gggtgagggt acatggatca tgccgctgag cctttactccgagctagcag aaaactggct 1620 ggctgtagat cgagttcgcc atcgccctca actttgtcgatgtgatggtg atcatggata 1680 gatgtatgtc acagcgcaaa gcggagactc caatgttcgcaaccatactg taaatgtggg 1740 aggggaaaat ttctccaagg ggaccaacca tggaacagaggaagatggag caaagtagtt 1800 gctcttgcta tcatttcaga gctcagggct gatctctagctcagatagga gtactaattc 1860 ttatgtgtgt gtatttgatc tatgatggta cgttttaagagttggaccag atattgcttg 1920 atctttacat gatgaattga tgatatgcaa aaaatggtatttgacatgga gacgcaggaa 1980 cagggtggtt cattgatctg tttgatctta atcttggagagctaggatgg aaaactgagt 2040 tggtggttat atctctacta cttttgatgt tgtccaaaactaatttgttt ctactcatgt 2100 acgtatgtaa aaaggaaaat tttaatgatt aaagaacttattgttctctt taatcagatt 2160 atttatcagt gggatttttc tttctctggt ctgtataaaaatactttttt tatcgtcaat 2220 cctcctaaaa ttcctattta taaaaaaaaa aaaaaagggcggccg 2265 97 2351 DNA Oryza minuta 97 gatcaaatga aagaaatatt tgaaaagatactaaagaaat gtggaggtct accgctagct 60 ataatcacaa taggtgcagt ccttgaagggaaagatataa aagagtggga aattttgtat 120 gctcaacttc catcagaact tgaaagcaatccaatcgctg aaccaatgaa gaaggtggta 180 acccttagtt acaattactt gccatctcatcttaagcctt gctttctgta cctttgcatc 240 tttcctgagg attttgatat ccaaaggaagcgcctagtac atagatggat tgcagagggg 300 tttattagag ctaggggtgg agtgggaattgtggatgtgg cacaaaagta ttttgatgag 360 ttgatcaacc gaagtatgat tcaagcatctagagtggata tagagggaaa tattaagagc 420 tgccgagtcc atgatatcat gcgtgatgtcatgatatcaa tatctaggga agaaaatttt 480 gtatacttga tgggtgatga tggaactagtgtagtggagg aaaatattcg ccatttagtg 540 caccatgata ctagcaagtg ttcaaatataggcatggatt ggagccatgt acggtcatta 600 actttgtttg gcaatgagag acccaaagggctatctcctt cattttgttt tccccaattg 660 aagatgctaa gagtgctgga tcttcaagatgtcaaatttg ggatgacaca aaaagatatc 720 ggaaaaatag ggttgctgcg tcacttgaaatatgtgaata ttggagggca ttcaagtatt 780 tatgcacttc ctaggtgtat aggaaaattaaaagacttgt gcactttgga cataactgac 840 agttacatta cagaactacc aactgagattagtaaattgc agagtctatg tattctccgt 900 tgtagaggaa gaccaaactc gggggattttaatctaaatg atcctaagga ttgtctaatt 960 gccttctcat gtttgcctct gcttatggctgcaaccgatt ctgatgaacg taataaaata 1020 attgctgagc tacacgtggg ttgttcaagtcaatggtctc ctaatggtgg tacatatggt 1080 gtgagggtac ctagaggaat caagaatttgaaaaggctgc aggtgctaga gacagtggat 1140 atcaatcgaa ccagcagtaa gtcagttgaagagttgggag agcttatcca gctaagaaaa 1200 ctaagtgtag taacacaagg gtccaccaaggaaaaatgca aaatactctg tacagccatt 1260 cagaagctaa cttccctcaa aactctctatctgaatgctc atggaccctt ggatactgga 1320 acacttgaat ggctacattc tatttcccatcttccttccc tgaggatcat cagattgatc 1380 ggatacatga aagagatgcc caactggtttagggagctca gacagctggt gaagattcac 1440 ttgcagaata gccaactaga ggaggataaaacaatggaga tactcggaga actgcctaat 1500 ctcatgctcc ttttccttag ttggcgagcgtatgctgggg ggaagctggt gttcagggag 1560 ggaacattct aaaatctcag ggtgctaatcattcgcaatc agaagcaact gaaagaggtg 1620 agatttgaag agggaacctc accctggatggaaaggattg atatcagaga atgcagattg 1680 acatcaggga ttgctggtat caaacaccttccgaggctta aggagatttc acttgaatac 1740 agtgctaaag tggtgaggct aggtcagctagaggttgaaa tgggcacaca ccccaatcga 1800 cccatgttgc gcctgtttgg ggagcgaagccgtcatgacc tggggaacac ccaagtaact 1860 gtcgatgaac aacaactgca gccaaaccaaacagttagcg aagaatgctt gatggatgcc 1920 gatcttgctt ctagatgatg atgatcagcttcctgttgca agctctcctg gtgatcgttc 1980 ttcttcagtt cttgacctac caccaatgctggctagcttc ctgtagtgct gatctcggag 2040 tcccctgctg aattttgcac ctccggcgccgagaggccag ttcactgtac atcccccctg 2100 agacacaatg tcggtatttg tttgattggatatgatttga ttaagtacga tgatttttgg 2160 ttctgaaaat ttgtatcatg attggttggagcgtgtgatt aggatgctta tatttgcagc 2220 agaggtcgat cgtattttgt tttgctgtgcacacaaaatg ttcgtcccaa atgtcttgtc 2280 agatattttg tagcttttga ttawmaarwamwmmrwawww awrwwwcgta atatgaaaaa 2340 aaaaaaaaaa a 2351

What is claimed is:
 1. An isolated nucleic acid comprising a nucleotidesequence that encodes an NBS protein which comprises an amino acidsequence selected from the group consisting of SEQ ID NO. 85, SEQ ID NO.87, SEQ ID NO 89, SEQ ID NO. 91, SEQ ID NO. 93, and SEQ ID NO.
 95. 2.The isolated nucleic acid of claim 1 wherein said nucleotide sequenceencodes an NBS 1, NBS2, or NBS3 protein.
 3. An isolated nucleic acidthat hybridizes under stringent conditions to the sequence set forth inSEQ ID NO. 84, 86, 88, 90, 92, or 94 or a sequence which iscomplementary thereto, wherein said isolated nucleic acid is at least 15nucleotides in length.
 4. The isolated nucleic acid of claim 3 whereinsaid nucleic acid is at least 50 nucleotides in length.
 5. The isolatednucleic acid of claim 3 wherein the isolated nucleic acid is of the samelength as SEQ ID NO. 84, SEQ ID NO. 86, SEQ ID NO. 88, SEQ ID NO. 90,SEQ ID NO. 92, or SEQ ID NO. 94, respectively.
 6. A DNA constructcomprising in the 5′ to 3′ direction: a promoter regulatory element, anucleic acid encoding an NBS protein from a rice plant resistant toinfection with Magnaporthe grisea, and a transcriptional terminatorsequence, wherein either the promoter regulatory element or thetranscriptional terminator sequence is not naturally associated withsaid nucleic acid.
 7. The DNA construct of claim 6 wherein the NBSprotein is NBS1, NBS2, or NBS3, and the nucleic acid comprises asequence which encodes the amino acid sequence set forth in SEQ ID NO.85, SEQ ID NO. 87, or SEQ ID NO. 89, respectively.
 8. A plant cellstably incorporating into its genome the nucleic acid construct of claim6.
 9. The plant cell of claim 8 wherein the nucleic acid constructcomprises a nucleotide sequence which encodes the amino acid sequenceset forth in SEQ ID NO. 84, SEQ ID NO. 86, or SEQ ID NO. 88
 10. Theplant cell of claim 8 wherein said promoter regulatory element is aconstitutive promoter or an inducible promoter.
 11. The plant cell ofclaim 8 wherein the plant cell is from a rice plant, a wheat plant or abarley plant.
 12. A transgenic plant produced from the plant cell ofclaim
 8. 13. The transgenic plant of claim 12 wherein said transgenicplant is resistant to infection by Magnaporthe grisea.
 14. Thetransgenic plant of claim 12 wherein said transgenic plant is atransgenic rice plant, a transgenic wheat plant or a transgenic barleyplant.
 15. Seed and progeny produced from the plant cell of claim 12.16. A method for the production of a plant having an NBS rice blastresistant allele, wherein said plant is a member of the grass family,comprising: (A) crossing a first plant having an NBS rice blastresistant allele with a second plant having an NBS rice blast resistantallele to produce a population of plants; (B) screening said populationof plants for a member having an NBS resistant allele with a nucleicacid molecule capable of specifically hybridizing to SEQ ID NO. 84, 86,88, 90, 92, or 94 or a complement thereof or a fragment thereof havingat least 15 nucleotides; and, (C) selecting said member for furthercrossing and selection.
 17. The method of claim 16 wherein said plant isa rice plant, a wheat plant or a barley plant.
 18. The method of claim16 wherein said screening is achieved using a polymerase chain reactionand a primer set which amplifies all or a portion of SEQ ID NO. 84, 86,88, 90, 92, or
 94. 19. The method of claim 16 wherein said screening isachieved using a probe that binds under stringent conditions to asequence of 15 contiguous nucleotides within SEQ ID NO. 84, 86, 88, 90,92, or
 94. 20. A method of identifying a plant comprising an NBS riceblast resistant allele, comprising: isolating DNA or RNA from saidplant, and assaying for the presence of one or more NBS rice blastresistant alleles by a polymerase chain reaction (PCR) which employs afirst primer that is at least 15 nucleotides in length and comprises asequence which is identical to a contiguous sequence in SEQ ID NO. 84,SEQ ID NO 86, SEQ ID NO. 88, SEQ ID NO. 90, SEQ ID NO. 92, or SEQ ID NO94 and a second primer which is the reverse complement of a contiguoussequence in SEQ ID NO. 84, SEQ ID NO 86, SEQ ID NO. 88, SEQ ID NO. 90,SEQ ID NO. 92, or SEQ ID NO 94, and wherein the product that is produceby said PCR is at least 50 nucleotides in length.